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Recommended Books and Resources 


There is more or less a well established route to teaching electromagnetism. A number 
of good books follow this. 


e David J. Griffiths, “Introduction to Electrodynamics” 


A superb book. The explanations are clear and simple. It doesn’t cover quite as much 
as we'll need for these lectures, but if you’re looking for a book to cover the basics then 
this is the first one to look at. 


e Edward M. Purcell and David J. Morin “Electricity and Magnetism” 


Another excellent book to start with. It has somewhat more detail in places than 
Griffiths, but the beginning of the book explains both electromagnetism and vector 
calculus in an intertwined fashion. If you need some help with vector calculus basics, 
this would be a good place to turn. If not, you’ll need to spend some time disentangling 
the two topics. 


e J. David Jackson, “Classical Electrodynamics” 


The most canonical of physics textbooks. This is probably the one book you can find 
on every professional physicist’s shelf, whether string theorist or biophysicist. It will 
see you through this course and next year’s course. The problems are famously hard. 
But it does have div, grad and curl in polar coordinates on the inside cover. 


e A. Zangwill, “Modern Electrodynamics” 
A great book. It is essentially a more modern and more friendly version of Jackson. 
e Feynman, Leighton and Sands, “The Feynman Lectures on Physics, Volume IT” 


Feynman’s famous lectures on physics are something of a mixed bag. Some explanations 
are wonderfully original, but others can be a little too slick to be helpful. And much of 
the material comes across as old-fashioned. Volume two covers electromagnetism and, 
in my opinion, is the best of the three. 


A number of excellent lecture notes, including the Feynman lectures, are available 
on the web. Links can be found on the course webpage: 
http: //www.damtp.cam.ac.uk/user/tong/em.htm] 
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These lecture notes contain material covering two courses on Electromagnetism. 
In Cambridge, these courses are called Part IB Electromagnetism and Part II Electro- 
dynamics. The notes owe a debt to the previous lecturers of these courses, including 
Natasha Berloff, John Papaloizou and especially Anthony Challinor. 


The notes assume a familiarity with Newtonian mechanics and special relativity, 
as covered in the Dynamics and Relativity notes. They also assume a knowledge of 
Vector Calculus. The notes do not cover the classical field theory (Lagrangian and 
Hamiltonian) section of the Part II course. 


1. Introduction 


There are, to the best of our knowledge, four forces at play in the Universe. At the very 
largest scales — those of planets or stars or galaxies — the force of gravity dominates. 
At the very smallest distances, the two nuclear forces hold sway. For everything in 
between, it is force of electromagnetism that rules. 


At the atomic scale, electromagnetism (admittedly in conjunction with some basic 
quantum effects) governs the interactions between atoms and molecules. It is the force 
that underlies the periodic table of elements, giving rise to all of chemistry and, through 
this, much of biology. It is the force which binds atoms together into solids and liquids. 
And it is the force which is responsible for the incredible range of properties that 
different materials exhibit. 


At the macroscopic scale, electromagnetism manifests itself in the familiar phenom- 
ena that give the force its name. In the case of electricity, this means everything from 
rubbing a balloon on your head and sticking it on the wall, through to the fact that you 
can plug any appliance into the wall and be pretty confident that it will work. For mag- 
netism, this means everything from the shopping list stuck to your fridge door, through 
to trains in Japan which levitate above the rail. Harnessing these powers through the 
invention of the electric dynamo and motor has transformed the planet and our lives 
on it. 


As if this wasn’t enough, there is much more to the force of electromagnetism for it 
is, quite literally, responsible for everything you’ve ever seen. It is the force that gives 
rise to light itself. 


Rather remarkably, a full description of the force of electromagnetism is contained in 
four simple and elegant equations. These are known as the Maxwell equations. There 
are few places in physics, or indeed in any other subject, where such a richly diverse 
set of phenomena flows from so little. The purpose of this course is to introduce the 
Maxwell equations and to extract some of the many stories they contain. 


However, there is also a second theme that runs through this course. The force of 
electromagnetism turns out to be a blueprint for all the other forces. There are various 
mathematical symmetries and structures lurking within the Maxwell equations, struc- 
tures which Nature then repeats in other contexts. Understanding the mathematical 
beauty of the equations will allow us to see some of the principles that underly the laws 
of physics, laying the groundwork for future study of the other forces. 


1.1 Charge and Current 


Each particle in the Universe carries with it a number of properties. These determine 
how the particle interacts with each of the four forces. For the force of gravity, this 
property is mass. For the force of electromagnetism, the property is called electric 
charge. 


For the purposes of this course, we can think of electric charge as a real number, 
q E R. Importantly, charge can be positive or negative. It can also be zero, in which 
case the particle is unaffected by the force of electromagnetism. 


The SI unit of charge is the Coulomb, denoted by C. It is, like all SI units, a parochial 
measure, convenient for human activity rather than informed by the underlying laws 
of the physics. (We’ll learn more about how the Coulomb is defined in Section 3.5). 
At a fundamental level, Nature provides us with a better unit of charge. This follows 
from the fact that charge is quantised: the charge of any particle is an integer multiple 
of the charge carried by the electron which we denoted as —e, with 


e = 1.602176634 x 10°" C 


A much more natural unit would be to simply count charge as q = ne with n € Z. 
Then electrons have charge —1 while protons have charge +1 and neutrons have charge 
0. Nonetheless, in this course, we will bow to convention and stick with SI units. 


(An aside: the charge of quarks is actually q = —e/3 and q = 2e/3. This doesn’t 
change the spirit of the above discussion since we could just change the basic unit. But, 
apart from in extreme circumstances, quarks are confined inside protons and neutrons 
so we rarely have to worry about this). 


One of the key goals of this course is to move beyond the dynamics of point particles 
and onto the dynamics of continuous objects known as fields. To aid in this, it’s useful 
to consider the charge density, 


p(x, t) 


defined as charge per unit volume. The total charge Q in a given region V is simply 
Q= fy dx p(x, t). In most situations, we will consider smooth charge densities, which 
can be thought of as arising from averaging over many point-like particles. But, on 
occasion, we will return to the idea of a single particle of charge q, moving on some 
trajectory r(t), by writing p = qô(x — r(t)) where the delta-function ensures that all 
the charge sits at a point. 


More generally, we will need to describe the movement of charge from one place to 
another. This is captured by a quantity known as the current density J(x,t), defined 
as follows: for every surface S, the integral 


r= | 3-48 
S 


counts the charge per unit time passing through S. (Here dS is the unit normal to 
S). The quantity I is called the current. In this sense, the current density is the 
current-per-unit-area. 


The above is a rather indirect definition of the cur- 
rent density. To get a more intuitive picture, consider a 
continuous charge distribution in which the velocity of a 
small volume, at point x, is given by v(x,t). Then, ne- 
glecting relativistic effects, the current density is 


J = pv 


In particular, if a single particle is moving with velocity 
v = r(t), the current density will be J = qvô?(x — r(t)). 
This is illustrated in the figure, where the underlying charged particles are shown as 


Figure 1: Current flux 


red balls, moving through the blue surface S. 


As a simple example, consider electrons mov- 


ing along a wire. We model the wire as a long © » e © bg 
cylinder of cross-sectional area A as shown be- é < ind ‘ 
low. The electrons move with velocity v, paral- e e © 


lel to the axis of the wire. (In reality, the elec- 
trons will have some distribution of speeds; we ; 
take v to be their average velocity). If there are Pipure A Tiew 
n electrons per unit volume, each with charge 
q, then the charge density is p = nq and the current density is J = ngv. The current 


itself is J = |J|A. 


Throughout this course, the current density J plays a much more prominent role 
than the current J. For this reason, we will often refer to J simply as the “current” 
although we’ll be more careful with the terminology when there is any possibility for 
confusion. 


1.1.1 The Conservation Law 


The most important property of electric charge is that it’s conserved. This, of course, 
means that the total charge in a system can’t change. But it means much more than 
that because electric charge is conserved locally. An electric charge can’t just vanish 
from one part of the Universe and turn up somewhere else. It can only leave one point 
in space by moving to a neighbouring point. 


The property of local conservation means that p can change in time only if there 
is a compensating current flowing into or out of that region. We express this in the 
continuity equation, 

Op 
—+V-J=0 1.1 
Ji (1.1) 
This is an important equation. It arises in any situation where there is some quantity 
that is locally conserved. 


To see why the continuity equation captures the right physics, it’s best to consider 
the change in the total charge Q contained in some region V. 


dd de f day: I=- | 3-as 


From our previous discussion, f gJ- dS is the total current flowing out through the 
boundary S of the region V. (It is the total charge flowing out, rather than in, because 
dS is the outward normal to the region V). The minus sign is there to ensure that if 
the net flow of current is outwards, then the total charge decreases. 


If there is no current flowing out of the region, then dQ/dt = 0. This is the statement 
of (global) conservation of charge. In many applications we will take V to be all of 
space, R, with both charges and currents localised in some compact region. This 
ensures that the total charge remains constant. 


1.2 Forces and Fields 


Any particle that carries electric charge experiences the force of electromagnetism. But 
the force does not act directly between particles. Instead, Nature chose to introduce 
intermediaries. These are fields. 


In physics, a “field” is a dynamical quantity which takes a value at every point in 
space and time. To describe the force of electromagnetism, we need to introduce two 


fields, each of which is a three-dimensional vector. They are called the electric field E 
and the magnetic field B, 


E(x,t) and B(x,t) 


When we talk about a “force” in modern physics, we really mean an intricate interplay 
between particles and fields. There are two aspects to this. First, the charged particles 
create both electric and magnetic fields. Second, the electric and magnetic fields guide 
the charged particles, telling them how to move. This motion, in turn, changes the 
fields that the particles create. We’re left with a beautiful dance with the particles and 
fields as two partners, each dictating the moves of the other. 


This dance between particles and fields provides a paradigm which all other forces in 
Nature follow. It feels like there should be a deep reason that Nature chose to introduce 
fields associated to all the forces. And, indeed, this approach does provide one over- 
riding advantage: all interactions are local. Any object — whether particle or field — 
affects things only in its immediate neighbourhood. This influence can then propagate 
through the field to reach another point in space, but it does not do so instantaneously. 
It takes time for a particle in one part of space to influence a particle elsewhere. This 
lack of instantaneous interaction allows us to introduce forces which are compatible 
with the theory of special relativity, something that we will explore in more detail in 
Section 5 


The purpose of this course is to provide a mathematical description of the interplay 
between particles and electromagnetic fields. In fact, you’ve already met one side of 
this dance: the position r(t) of a particle of charge q is dictated by the electric and 
magnetic fields through the Lorentz force law, 


F =q(E+?xB) (1.2) 


The motion of the particle can then be determined through Newton’s equation F = mr. 
We explored various solutions to this in the Dynamics and Relativity course. Roughly 
speaking, an electric field accelerates a particle in the direction E, while a magnetic 
field causes a particle to move in circles in the plane perpendicular to B. 


We can also write the Lorentz force law in terms of the charge distribution p(x, t) 
and the current density J(x,t). Now we talk in terms of the force density f(x, t), which 
is the force acting on a small volume at point x. Now the Lorentz force law reads 


f=pE+JxB (1.3) 


1.2.1 The Maxwell Equations 


In this course, most of our attention will focus on the other side of the dance: the way 
in which electric and magnetic fields are created by charged particles. This is described 
by a set of four equations, known collectively as the Maxwell equations. They are: 


Via (1.4) 
€0 
V-B=0 (1.5) 
OB 
OE 
VxB- Hoos = Lod (1.7) 


The equations involve two constants. The first is the electric constant (known also, in 
slightly old-fashioned terminology, as the permittivity of free space), 


co © 8.85 x 10° m~? Kg"! s? O? 


It can be thought of as characterising the strength of the electric interactions. The 
other is the magnetic constant (or permeability of free space), 


lo = 4r x 1077 m Kg C7’? 
~ 1.25 x 10° m Kg C7? 


The presence of 47 in this formula isn’t telling us anything deep about Nature, but 
simply reflects a rather outdated way in which this constant was first defined. (We 
will explain this in more detail in Section 3.5). Nonetheless, this can be thought of as 
characterising the strength of magnetic interactions (in units of Coulombs). 


The Maxwell equations (1.4), (1.5), (1.6) and (1.7) will occupy us for the rest of the 
course. Rather than trying to understand all the equations at once, we'll proceed bit 
by bit, looking at situations where only some of the equations are important. By the 
end of the lectures, we will understand the physics captured by each of these equations 
and how they fit together. 


However, equally importantly, we will also explore the mathematical structure of the 
Maxwell equations. At first glance, they look just like four random equations from 
vector calculus. Yet this couldn’t be further from the truth. The Maxwell equations 
are special and, when viewed in the right way, are the essentially unique equations 
that can describe the force of electromagnetism. The full story of why these are the 
unique equations involves both quantum mechanics and relativity and will only be told 
in later courses. But we will start that journey here. The goal is that by the end of 
these lectures you will be convinced of the importance of the Maxwell equations on 
both experimental and aesthetic grounds. 


2. Electrostatics 


In this section, we will be interested in electric charges at rest. This means that there 
exists a frame of reference in which there are no currents; only stationary charges. Of 
course, there will be forces between these charges but we will assume that the charges 
are pinned in place and cannot move. The question that we want to answer is: what 
is the electric field generated by these charges? 


Since nothing moves, we are looking for time independent solutions to Maxwell’s 
equations with J = 0. This means that we can consistently set B = 0 and we’re left 
with two of Maxwell’s equations to solve. They are 


TEs- (2.1) 
E0 
and 
VxE=0 (2.2) 


If you fix the charge distribution p, equations (2.1) and (2.2) have a unique solution. 
Our goal in this section is to find it. 


2.1 Gauss’ Law 


Before we proceed, let’s first present equation (2.1) in a slightly different form that 
will shed some light on its meaning. Consider some closed region V C R? of space. 
We’ll denote the boundary of V by S = OV. We now integrate both sides of (2.1) over 
V. Since the left-hand side is a total derivative, we can use the divergence theorem to 
convert this to an integral over the surface S. We have 


1 
[e@cv-E= | Bas = dx p 
V S €0 Jv 


The integral of the charge density over V is simply the total charge contained in the 
region. We’ll call it Q = f dx p. Meanwhile, the integral of the electric field over S' is 
called the flux through S. We learn that the two are related by 


fe as-2 (2.3) 
5 


EQ 


This is Gauss’s law. However, because the two are entirely equivalent, we also refer to 
the original (2.1) as Gauss’s law. 


PKD 
\ 


Figure 3: The flux through S and S’ is Figure 4: The flux through S vanishes. 
the same. 


Notice that it doesn’t matter what shape the surface S takes. As long as it surrounds 
a total charge Q, the flux through the surface will always be Q/e . This is shown, for 
example, in the left-hand figure above. A fancy way of saying this is that the integral of 
the flux doesn’t depend on the geometry of the surface, but does depend on its topology 
since it must surround the charge Q. The choice of S is called the Gaussian surface; 
often there’s a smart choice that makes a particular problem simple. 


Only charges that lie inside V contribute to the flux. Any charges that lie outside will 
produce an electric field that penetrates through S at some point, giving negative flux, 
but leaves through the other side of S, depositing positive flux. The total contribution 
from these charges that lie outside of V is zero, as illustrated in the right-hand figure 
above. 


For a general charge distribution, we’ll need to use both Gauss’ law (2.1) and the 
extra equation (2.2). However, for rather special charge distributions — typically those 
with lots of symmetry — it turns out to be sufficient to solve the integral form of Gauss’ 
law (2.3) alone, with the symmetry ensuring that (2.2) is automatically satisfied. We 
start by describing these rather simple solutions. We’ll then return to the general case 
in Section 2.2. 


2.1.1 The Coulomb Force 


We'll start by showing that Gauss’ law (2.3) reproduces the more familiar Coulomb 
force law that we all know and love. To do this, take a spherically symmetric charge 
distribution, centered at the origin, contained within some radius R. This will be our 
model for a particle. We won’t need to make any assumption about the nature of the 
distribution other than its symmetry and the fact that the total charge is Q. 


We want to know the electric field at some radius r > 
R. We take our Gaussian surface © to be a sphere of radius 
r as shown in the figure. Gauss’ law states 


[ea-¢ 
S E0 


At this point we make use of the spherical symmetry of the 


problem. This tells us that the electric field must point ra- 
dially outwards: E(x) = E(r)r. And, since the integral is 
only over the angular coordinates of the sphere, we can pull Figure 5: 
the function E(r) outside. We have 


[8-48 = E0) f #-a8 = Bem 


S €0 
where the factor of 4rr? has arisen simply because it’s the area of the Gaussian sphere. 
We learn that the electric field outside a spherically symmetric distribution of charge 
Q is 
Q 


= fÍ 
4Teor? 


E(x (2.4) 
That’s nice. This is the familiar result that we’ve seen before. (See, for example, the 
notes on Dynamics and Relativity). The Lorentz force law (1.2) then tells us that a 
test charge q moving in the region r > R experiences a force 
Qa 
r 


Anegr? 


This, of course, is the Coulomb force between two static charged particles. Notice that, 
as promised, 1/e€9 characterises the strength of the force. If the two charges have the 
same sign, so that Qq > 0, the force is repulsive, pushing the test charge away from 
the origin. If the charges have opposite signs, Qq < 0, the force is attractive, pointing 
towards the origin. We see that Gauss’s law (2.1) reproduces this simple result that we 
know about charges. 


Finally, note that the assumption of symmetry was crucial in our above analysis. 
Without it, the electric field E(x) would have depended on the angular coordinates of 
the sphere S and so been stuck inside the integral. In situations without symmetry, 
Gauss’ law alone is not enough to determine the electric field and we need to also use 
V x E = 0. We'll see how to do this in Section 2.2. If you’re worried, however, it’s 
simple to check that our final expression for the electric field (2.4) does indeed solve 
VxE=0. 


= ]0 = 


Coulomb vs Newton 


The inverse-square form of the force is common to both electrostatics and gravity. It’s 
worth comparing the relative strengths of the two forces. For example, we can look 
at the relative strengths of Newtonian attraction and Coulomb repulsion between two 
electrons. These are point particles with mass me and charge —e given by 


em 1.6 x 107! Coulombs and m, 9.1 x107% Kg 


Regardless of the separation, we have 


2 
F Coulomb _ e 1 


FNewton ÅTEo Gm? 


The strength of gravity is determined by Newton’s constant G œ 6.7 x 1071 m3Kg7!s?. 
Plugging in the numbers reveals something extraordinary: 


F Coulomb = 1042 
FNewton 

Gravity is puny. Electromagnetism rules. In fact you knew this already. The mere act 

of lifting up you arm is pitching a few electrical impulses up against the gravitational 


might of the entire Earth. Yet the electrical impulses win. 


However, gravity has a trick up its sleeve. While electric charges come with both 
positive and negative signs, mass is only positive. It means that by the time we get to 
macroscopically large objects — stars, planets, cats — the mass accumulates while the 
charges cancel to good approximation. This compensates the factor of 1074? suppression 
until, at large distance scales, gravity wins after all. 


The fact that the force of gravity is so ridiculously tiny at the level of fundamental 
particles has consequence. It means that we can neglect gravity whenever we talk 
about the very small. (And indeed, we shall neglect gravity for the rest of this course). 
However, it also means that if we would like to understand gravity better on these very 
tiny distances — for example, to develop a quantum theory of gravity — then it’s going 
to be tricky to get much guidance from experiment. 


2.1.2 A Uniform Sphere 


The electric field outside a spherically symmetric charge distribution is always given by 
(2.4). What about inside? This depends on the distribution in question. The simplest 
is a sphere of radius R with uniform charge distribution p. The total charge is 


At 
= — R? 
Q za 


s= 


Let’s pick our Gaussian surface to be a sphere, centered at 
the origin, of radius r < R. The charge contained within 
this sphere is 4apr?/3 = Qr?/R°, so Gauss’ law gives 


Qr 
E . = 
f dS eRe 


Again, using the symmetry argument we can write E(r) = 


E(r)t and compute 


3 ; 
Je -dS = E(r) | ¢- dS = E(r)4ar? = ei Figure 6: 
S S cok 


This tells us that the electric field grows linearly inside the sphere 
Qr 


E(x) ~ 4treg R? i 


r< R (2.5) 
Outside the sphere we revert to the inverse-square 
form (2.4). At the surface of the sphere, r = R, the 
electric field is continuous but the derivative, dE/dr, 


Ẹ 


is not. This is shown in the graph. 
R r 


Figure 7: 
2.1.3 Line Charges 
Consider, next, a charge smeared out along a line which 
we'll take to be the z-axis. We’ll take uniform charge den- 
sity 7 per unit length. (If you like you could consider a 
solid cylinder with uniform charge density and then send 
the radius to zero). We want to know the electric field due 


to this line of charge. 


Our set-up now has cylindrical symmetry. We take the 
Gaussian surface to be a cylinder of length L and radius r. 
We have 


L 
f B= = 
S €0 Figure 8: 


Again, by symmetry, the electric field points in the radial 
direction, away from the line. We’ll denote this vector in cylindrical polar coordinates 
as Î so that E = E(r)f. The symmetry means that the two end caps of the Gaussian 


ay 


surface don’t contribute to the integral because their normal points in the z direction 
and z-r = 0. We’re left only with a contribution from the curved side of the cylinder, 


L 
fF -dS = E(r)2rrL = 12 
S a) 
So that the electric field is 
) 2 
E(r) = . 
(r) Ine. (2 6) 


Note that, while the electric field for a point charge drops off as 1/r? (with r the radial 
distance), the electric field for a line charge drops off more slowly as 1/r. (Of course, the 
radial distance r means slightly different things in the two cases: it isr = \/x? + y? + 2? 
for the point particle, but is r = ,/x? + y? for the line). 


2.1.4 Surface Charges and Discontinuities 


Now consider an infinite plane, which we 
take to be z = 0, carrying uniform charge 
per unit area, 7. We again take our Gaus- 
sian surface to be a cylinder, this time with 
its axis perpendicular to the plane as shown 


in the figure. In this context, the cylin- 
der is sometimes referred to as a Gaussian Figure 9: 
“pillbox” (on account of Gauss’ well known 

fondness for aspirin). On symmetry grounds, we have 


E=E(z)% 


Moreover, the electric field in the upper plane, z > 0, must point in the opposite 
direction from the lower plane, z < 0, so that E(z) = —E(—z). 


The surface integral now vanishes over the curved side of the cylinder and we only 
get contributions from the end caps, which we take to have area A. This gives 


A 
i -dS = E(z)A— E(-z)A = 2E(z)A= E 
5 €0 
The electric field above an infinite plane of charge is therefore 
oj 
E(z) = — 2 
= (2.7) 


Note that the electric field is independent of the distance from the plane! This is 
because the plane is infinite in extent: the further you move from it, the more comes 
into view. 
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Figure 10: The normal component of the Figure 11: The tangential component of 
electric field is discontinuous the electric field is continuous. 


There is another important point to take away from this analysis. The electric field 

is not continuous on either side of a surface of constant charge density. We have 
E(z > 0+) — E(z 307) = Z (2.8) 

0 

For this to hold, it is not important that the plane stretches to infinity. It’s simple to 
redo the above analysis for any arbitrary surface with charge density o. There is no 
need for o to be uniform and, correspondingly, there is no need for E at a given point 
to be parallel to the normal to the surface n. At any point of the surface, we can take 
a Gaussian cylinder, as shown in the left-hand figure above, whose axis is normal to 
the surface at that point. Its cross-sectional area A can be arbitrarily small (since, as 
we saw, it drops out of the final answer). If E+ denotes the electric field on either side 
of the surface, then 


ñ- Ej, — ñ- Ej_= (2.9) 


o 
m 
In contrast, the electric field tangent to the surface is continuous. To see this, we 
need to do a slightly different calculation. Consider, again, an arbitrary surface with 
surface charge. Now we consider a loop C with a length L which lies parallel to the 
surface and a length a which is perpendicular to the surface. We’ve drawn this loop in 
the right-hand figure above, where the surface is now shown side-on. We integrate E 
around the loop. Using Stoke’s theorem, we have 


feeds [Vx Eas 
c 


where S is the surface bounded by C. In the limit a — 0, the surface S' shrinks to zero 
size so this integral gives zero. This means that the contribution to line integral must 
also vanish, leaving us with 


nx E,—-nxE_=0 


This is the statement that the electric field tangential to the surface is continuous. 


= ]4.= 


A Pair of Planes E=0 


As a simple generalisation, consider a pair of infi- Yo = A 
nite planes at z = 0 and z = a, carrying uniform 

surface charge density +o respectively as shown in 

the figure. To compute the electric field we need 

only add the fields for arising from two planes, each 

of which takes the form (2.7). We find that the = 
electric field between the two planes is Figure 12: 


E=} 0<z<a (2.10) 
€0 


while E = 0 outside the planes 


A Plane Slab 


We can rederive the discontinuity (2.9) in the electric field by considering an infinite 
slab of thickness 2d and charge density per unit volume p. When our Gaussian pillbox 
lies inside the slab, with z < d, we have 


22A 
oe Bla = 
€0 €0 


2AE(z) 


Meanwhile, for z > d we get our earlier result (2.7). The electric field is now continuous 
as shown in the figure. Taking the limit d — 0 and p > œ such that the surface charge 
o = pd remains constant reproduces the discontinuity (2.8). 


Figure 13: The Gaussian surface for a Figure 14: The resulting electric field 
plane slab 
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A Spherical Shell 


Let’s give one last example that involves surface charge and t 
the associated discontinuity of the electric field. We’ll con- 
sider a spherical shell of radius R, centered at the origin, 
with uniform surface charge density ø. The total charge is 


Q = 4r R?o 


We already know that outside the shell, r > R, the electric 
field takes the standard inverse-square form (2.4). What 
about inside? Well, since any surface with r < R doesn’t Figure 15: 
surround a charge, Gauss’ law tells us that we necessarily 

have E = 0 inside. That means that there is a discontinuity at the surface r = R, 


Q o 


~ 4r R€9 €Q 


E+ t|,— B-?| 
in accord with the expectation (2.9). 


2.2 The Electrostatic Potential 


For all the examples in the last section, symmetry considerations meant that we only 
needed to consider Gauss’ law. However, for general charge distributions Gauss’ law is 
not sufficient. We also need to invoke the second equation, V x E = 0. 


In fact, this second equation is easily dispatched since V x E = 0 implies that the 
electric field can be written as the gradient of some function, 


E=-V¢ (2.11) 


The scalar ¢ is called the electrostatic potential or scalar potential (or, sometimes, just 
the potential). To proceed, we revert to the original differential form of Gauss’ law 
(2.1). This now takes the form of the Poisson equation 


V-E=2 > V%4=-£ 


0 €0 


(2.12) 


In regions of space where the charge density vanishes, we’re left solving the Laplace 
equation 


V’ =0 (2.13) 


Solutions to the Laplace equation are said to be harmonic functions. 
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A few comments: 


e The potential ¢ is only defined up to the addition of some constant. This seem- 
ingly trivial point is actually the beginning of a long and deep story in theoretical 
physics known as gauge invariance. We’ll come back to it in Section 5.3.1. For 
now, we’ll eliminate this redundancy by requiring that ¢(r) —> 0 as r > on. 


e We know from our study of Newtonian mechanics that the electrostatic potential 
is proportional to the potential energy experienced by a test particle. (See Section 
2.2 of the Dynamics and Relativity lecture notes). Specifically, a test particle of 
mass m, position r(t) and charge q moving in a background electric field has 
conserved energy 


1 
E= zi: t+ gor) 


e The Poisson equation is linear in both @ and p. This means that if we know 
the potential ¢, for some charge distribution pı and the potential @2 for another 
charge distribution p2, then the potential for p1 + p2 is simply ¢, + 2. What this 
really means is that the electric field for a bunch of charges is just the sum of 
the fields generated by each charge. This is called the principle of superposition 
for charges. This linearity of the equations is what makes electromagnetism easy 
compared to other forces of Nature. 


e We stated above that V x E = 0 is equivalent to writing E = —V@. This is true 
when space is R? or, in fact, if we take space to be any open ball in R3. But if our 
background space has a suitably complicated topology then there are solutions to 
V x E = 0 which cannot be written in the form E = —V¢. This is tied ultimately 
to the beautiful mathematical theory of de Rham cohomology. Needless to say, in 
this starter course we’re not going to worry about these issues. We’ll always take 
spacetime to have topology R* and, correspondingly, any spatial hypersurface to 
be R8. 


2.2.1 The Point Charge 
Let’s start by deriving the Coulomb force law yet again. We’ll take a particle of charge 
Q and place it at the origin. This time, however, we'll assume that the particle really is 
a point charge. This means that the charge density takes the form of a delta-function, 
p(x) = Q63(x). We need to solve the equation 

Q 53 


E0 


V?o = —-Ž0 (x) (2.14) 


MES 


You’ve solved problems of this kind in your Methods course. The solution is essentially 
the Green’s function for the Laplacian V°, an interpretation that we’ll return to in 
Section 2.2.3. Let’s recall how we find this solution. We first look away from the 
origin, r # 0, where there’s no funny business going on with delta-function. Here, 
we're looking for the spherically symmetric solution to the Laplace equation. This is 


-2 


r 


for some constant œ. To see why this solves the Laplace equation, we need to use the 
result 


Vr=f (2.15) 


where f is the unit radial vector in spherical polar coordinates, so x = rf. Using the 
chain rule, this means that V(1/r) = —ĉ/r? = —x/r?. This gives us 


V-x ==) 


r3 r5 


s=- i S vg = -a ( 


a 
But V -x = 3 and we find that V7¢ = 0 as required. 


It remains to figure out what to do at the origin where the delta-function lives. 
This is what determines the overall normalization a of the solution. At this point, it’s 
simplest to use the integral form of Gauss’ law to transfer the problem from the origin 
to the far flung reaches of space. To do this, we integrate (2.14) over some region V 
which includes the origin. Integrating the charge density gives 


ox) =Q8%) = [ @er=@ 


So, using Gauss’ law (2.3), we require 


f Vọ -dS = _@ 
S €0 
But this is exactly the kind of surface integral that we were doing in the last section. 


Substituting ¢ = a/r into the above equation, and choosing S to be a sphere of radius 
r, tells us that we must have a = Q/47¢€0, or 


jes (2.16) 
Amer 
Taking the gradient of this using (2.15) gives us Coulomb’s law 
Q 
E = — = 
(x) v9 Atregr? 
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The derivation of Coulomb’s law using the potential was somewhat more involved than 
the technique using Gauss’ law alone that we saw in the last section. However, as we'll 
now see, introducing the potential allows us to write down the solution to essentially 
any problem. 


A Note on Notation 


Throughout these lectures, we will use x and r interchangeably to denote position 
in space. For example, sometimes we’ll write integration over a volume as f d?x and 
sometimes as f d?r. The advantage of the r notation is that it looks more natural when 
working in spherical polar coordinates. For example, we have |r| = r which is nice. 
The disadvantage is that it can lead to confusion when working in other coordinate 
systems, in particular cylindrical polar. For this reason, we’ll alternate between the 
two notations, adopting the attitude that clarity is more important than consistency. 


2.2.2 The Dipole 


A dipole consists of two point charges, Q and —Q, a distance d apart. We place the 
first charge at the origin and the second at r = —d. The potential is simply the sum 
of the potential for each charge, 


-a-r 
4reo \r |r +d 


Similarly, the electric field is just the sum of the electric fields made by the two point 


charges. This follows from the linearity of the equations and is a simple application of 
the principle of superposition that we mentioned earlier. 


It will prove fruitful to ask what the dipole looks like far from the two point charges, 


at a distance r > |d|. We need to Taylor expand the second term above. The vector 
version of the Taylor expansion for a general function f(r) is given by 


fr+d) © f(t) +d: Vf) + 5(d- VAE) +... (2.17) 


Applying this to the function 1/|r + d| gives 


1 1 1 1 E 
x-+d. + —(d- = 

lr + d| r y 5 | N a 
ole dr 1 d-d _ 3(d-r} 
o r r3 2 r3 r5 


— 19 — 


(To derive the last term, it might be easiest to use index notation for d- V = d;0;). 
For our dipole, we’ll only need the first two terms in this expansion. They give the 
potential 


ete ee (2.18) 
We see that the potential for a dipole falls off as 1/r?. Correspondingly, the electric 


field drops off as 1/r?; both are one power higher than the fields for a point charge. 


The electric field is not spherically symmetric. The leading order contribution is 
governed by the combination 


p=Qd 


This is called the electric dipole moment. By convention, it points from the negative 
charge to the positive. The dipole electric field is 


a (2). (2.19) 


Notice that the sign of the electric field depends on where you sit in space. In some 
parts, the force will be attractive; in other parts repulsive. 


It’s sometimes useful to consider the limit d + 0 and Q —> œ such that p = Qd 
remains fixed. In this limit, all the ... terms in (2.18) and (2.19) disappear since they 
contain higher powers of d. Often when people talk about the “dipole”, they implicitly 
mean taking this limit. 


2.2.3 General Charge Distributions 

Our derivation of the potential due to a point charge (2.16), together with the principle 
of superposition, is actually enough to solve — at least formally — the potential due to 
any charge distribution. This is because the solution for a point charge is nothing other 
than the Green’s function for the Laplacian. The Green’s function is defined to be the 
solution to the equation 


WG(r; r’) = P(r — r’) 
which, from our discussion of the point charge, we now know to be 


1 1 
4r |r — r'| 


G(r; r’) = (2.20) 
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We can now apply our usual Green’s function methods to the general Poisson equation 
(2.12). In what follows, we’ll take p(r) # 0 only in some compact region, V, of space. 
The solution to the Poisson equation is given by 

1 3 P(r") 


= — 2.21 
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(To check this, you just have to keep your head and remember whether the operators 
are hitting r or r’. The Laplacian acts on r so, if we compute V7¢, it passes through 
the integral in the above expression and hits G(r;r’), leaving behind a delta-function 
which subsequently kills the integral). 


Similarly, the electric field arising from a general charge distribution is 


1 1 
E(r) = —Vo(r) = E E p(r’) Yeri 
1 r= r 
=. dè 1 WN o Sn 
Ateg Jy PR) lr — r'|? 


Given a very complicated charge distribution p(r), this equation will give back an 
equally complicated electric field E(r). But if we sit a long way from the charge 
distribution, there’s a rather nice simplification that happens. . . 


Long Distance Behaviour 


Suppose now that you want to know what the electric field 
looks like far from the region V. This means that we’re inter- 
ested in the electric field at r with |r| >> |r’| for all r' € V. We 
can apply the same Taylor expansion (2.17), now replacing d 


with —r’ for each r’ in the charged region. This means we can 


write Figure 16: 
1 E 1 , To ty 21l 
ir—r| = r Vit 5 V) Te 
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The leading term is just 


where Q = J, d?r’ p(r’) is the total charge contained within V. So, to leading order, if 
you’re far enough away then you can’t distinguish a general charge distribution from 
a point charge localised at the origin. But if you’re careful with experiments, you can 
tell the difference. The first correction takes the form of a dipole, 


a (2+2F +...) 


Aten \ r r2 


where 
p= | ar’ xo) 
V 


is the dipole moment of the distribution. One particularly important situation is when 
we have a neutral object with Q = 0. In this case, the dipole is the dominant contri- 
bution to the potential. 


We see that an arbitrarily complicated, localised charge distribution can be char- 
acterised by a few simple quantities, of decreasing importance. First comes the total 
charge Q. Next the dipole moment p which contains some basic information about 
how the charges are distributed. But we can keep going. The next correction is called 
the quadrupole and is given by 


where Q;; is a symmetric traceless tensor known as the quadrupole moment, given by 
Qi = f d’r' p(r’) (3rir; — dir” ?) 
V 


It contains some more refined information about how the charges are distributed. After 
this comes the octopole and so on. The general name given to this approach is the mul- 
tipole expansion. It involves expanding the function ¢ in terms of spherical harmonics. 
A systematic treatment can be found, for example, in the book by Jackson. 


A Comment on Infinite Charge Distributions 


In the above, we assumed for simplicity that the charge distribution was restricted to 
some compact region of space, V. The Green’s function approach still works if the 
charge distribution stretches to infinity. However, for such distributions it’s not always 
possible to pick ¢(r) — 0 as r > oo. In fact, we saw an example of this earlier. For an 
infinite line charge of density 7, we computed the electric field in (2.6). It goes as 


Lr) = 1 g 
27 €or 
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where now r? = z? + y? is the cylindrical radial coordinate perpendicular to the line. 
The potential @ which gives rise to this is 


eee L 
a) = 2T€p log (=) 


Because of the log function, we necessarily have ¢(r) — oo as r — oo. Instead, we 
need to pick an arbitrary, but finite distance, rọ at which the potential vanishes. 


2.2.4 Field Lines 


The usual way of depicting a vector is to draw an arrow whose length is proportional 
to the magnitude. For the electric field, there’s a slightly different, more useful way 
to show what’s going on. We draw continuous lines, tangent to the electric field E, 
with the density of lines proportional to the magnitude of E. This innovation, due 
to Faraday, is called the field line. (They are what we have been secretly drawing 
throughout these notes). 


Field lines are continuous. They begin and end only at charges. They can never 
cross. 


The field lines for positive and negative point charges are: 


KA u 
an a 


By convention, the positive charges act as sources for the lines, with the arrows emerg- 
ing. The negative charges act as sinks, with the arrows approaching. 


It’s also easy to draw the equipotentials — surfaces of constant ø — on this same 
figure. These are the surfaces along which you can move a charge without doing any 
work. The relationship E = —V@ ensures that the equipotentials cut the field lines at 
right angles. We usually draw them as dotted lines: 
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Meanwhile, we can (very) roughly sketch the field lines and equipotentials for the dipole 
(on the left) and for a pair of charges of the same sign (on the right): 


2.2.5 Electrostatic Equilibrium 


Here’s a simple question: can you trap an electric charge using only other charges? In 
other words, can you find some arrangements of charges such that a test charge sits in 
stable equilibrium, trapped by the fields of the others. 


There’s a trivial way to do this: just allow a negative charge to sit directly on top of 
a positive charge. But let’s throw out this possibility. We’ll ask that the equilibrium 
point lies away from all the other charges. 


There are some simple set-ups that spring to mind that might achieve this. Maybe 
you could place four positive charges at the vertices of a pyramid; or perhaps 8 positive 
charges at the corners of a cube. Is it possible that a test positive charge trapped in 
the middle will be stable? It’s certainly repelled from all the corners, so it might seem 
plausible. 


The answer, however, is no. There is no electrostatic equilibrium. You cannot trap 
an electric charge using only other stationary electric charges, at least not in a stable 
manner. Since the potential energy of the particle is proportional to ¢, mathematically, 
this is the statement that a harmonic function, obeying V7¢ = 0, can have no minimum 
or maximum. 


To prove that there can be no electrostatic equilibrium, let’s suppose the opposite: 
that there is some point in empty space r, that is stable for a particle of charge q > 0. 
By “empty space”, we mean that p(r) = 0 in a neighbourhood of r,. Because the point 
is stable, if the particle moves away from this point then it must always be pushed 
back. This, in turn, means that the electric field must always point inwards towards 
the point r,; never away. We could then surround r, by a small surface S and compute 


[Basco 
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But, by Gauss’ law, the right-hand side must be the charge contained within S which, 
by assumption, is zero. This is our contradiction: electrostatic equilibrium does not 
exist. 


Of course, if you’re willing to use something other than electrostatic forces then you 
can construct equilibrium situations. For example, if you restrict the test particle to 
lie on a plane then it’s simple to check that equal charges placed at the corners of a 
polygon will result in a stable equilibrium point in the middle. But to do this you need 
to use other forces to keep the particle in the plane in the first place. 


2.3 Electrostatic Energy 


There is energy stored in the electric field. In this section, we calculate how much. 


Let’s start by recalling a fact from our first course on classical mechanics!. Suppose 
we have some test charge q moving in a background electrostatic potential ¢. We’ll 
denote the potential energy of the particle as U(r). (We used the notation V(r) in the 
Dynamics and Relativity course but we’ll need to reserve V for the voltage later). The 
potential U(r) of the particle can be thought of as the work done bringing the particle 
in from infinity; 


ue) =- f F-dr=+4 | Vo- de= alr 


where we’ve assumed our standard normalization of ¢(r) > 0 as r > oo. 


Consider a distribution of charges which, for now, we’ll take to be made of point 
charges q; at positions r;. The electrostatic potential energy stored in this configuration 
is the same as the work required to assemble the configuration in the first place. (This 
is because if you let the charges go, this is how much kinetic energy they will pick up). 
So how much work does it take to assemble a collection of charges? 


Well, the first charge is free. In the absence of any electric field, you can just put it 
where you like — say, rı. The work required is W4 = 0. 


To place the second charge at rə takes work 
age 1 
= Ameo [rı — rol 
Note that if the two charges have the same sign, so qiq2 > 0, then W2 > 0 which is 
telling us that we need to put work in to make them approach. If qıq2 < 0 then W2 < 0 
where the negative work means that the particles wanted to be drawn closer by their 
mutual attraction. 


'See Section 2.2 of the lecture notes on Dynamics and Relativity. 
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The third charge has to battle against the electric field due to both qı and q2. The 


work required is 
W = q3 ( q2 + qı ) 
4reg \lr2 = rss| [r1 — rsl 


and so on. The total work needed to assemble all the charges is the potential energy 


stored in the configuration, 


=) W= = 2.23 
2s T R ( ) 
i=1 i<j 

where }/,- j means that we sum over each pair of particles once. In fact, you probably 


could have just written down (2.23) as the potential energy stored in the configuration. 
The whole purpose of the above argument was really just to nail down a factor of 1/2: 


do we sum over all pairs of particles }°,_, or all particles 5°, P rs The answer, as we 


i<j 
have seen, is all pairs. 


We can make that factor of 1 ° 2 even more explicit by writing 


ij 
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where now we sum over each pair twice. 


There is a slicker way of writing (2.24). The potential at r; due to all the other 
charges qj, j At is 


1 qj 
Ate = lr; — rj] 


o(ri) = 


which means that we can write the potential energy as 


= ; 2, qiolr:) (2.25) 


This is the potential energy for a set of point charges. But there is an obvious general- 
ization to charge distributions p(r). We’ll again assume that p(r) has compact support 
so that the charge is localised in some region of space. The potential energy associated 
to such a charge distribution should be 


U = 5 fe p(r) d(x) (2.26) 


where we can quite happily take the integral over all of R3, safe in the knowledge that 
anywhere that doesn’t contain charge has p(r) = 0 and so won’t contribute. 
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Now this is in a form that we can start to play with. We use Gauss’ law to rewrite 
it as 


U= fer (VE) 6= Ê f ër [V (Bd) -B- Vo) 


But the first term is a total derivative. And since we’re taking the integral over all of 
space and g(r) > 0 as r — on, this term just vanishes. In the second term we can 
replace V@ = —E. We find that the potential energy stored in a charge distribution 
has an elegant expression solely in terms of the electric field that it creates, 


U=2 fdrB-B (2.27) 
Isn’t that nice! 


2.3.1 The Energy of a Point Particle 


There is a subtlety in the above derivation. In fact, I totally tried to pull the wool over 
your eyes. Here it’s time to own up. 


First, let me say that the final result (2.27) is right: this is the energy stored in the 
electric field. But the derivation above was dodgy. One reason to be dissatisfied is 
that we computed the energy in the electric field by equating it to the potential energy 
stored in a charge distribution that creates this electric field. But the end result doesn’t 
depend on the charge distribution. This suggests that there should be a more direct 
way to arrive at (2.27) that only talks about fields and doesn’t need charges. And there 
is. We will see it later. 


But there is also another, more worrying problem with the derivation above. To 
illustrate this, let’s just look at the simplest situation of a point particle. This has 
electric field 

= no (2.28) 
So, by (2.27), the associated electric field should carry energy. But we started our 
derivation above by assuming that a single particle didn’t carry any energy since it 
didn’t take any work to put the particle there in the first place. What’s going on? 


Well, there was something of a sleight of hand in the derivation above. This occurs 
when we went from the expression g@ in (2.25) to p@ in (2.26). The former omits the 
“self-energy” terms; there is no contribution arising from q;@(r;). However, the latter 
includes them. The two expressions are not quite the same. This is also the reason 
that our final expression for the energy (2.27) is manifestly positive, while q@ can be 
positive or negative. 


== 


So which is right? Well, which form of the energy you use rather depends on the 
context. It is true that (2.27) is the correct expression for the energy stored in the 
electric field. But it is also true that you don’t have to do any work to put the first 
charge in place since we’re obviously not fighting against anything. Instead, the “self- 
energy” contribution coming from E- E in (2.28) should simply be thought of — using 


E = mc? — as a contribution to the mass of the particle. 


We can easily compute this contribution for, say, an electron with charge q = —e. 
Let’s call the radius of the electron a. Then the energy stored in its electric field is 


2 © Arr? 2 4 
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We see that, at least as far as the energy is concerned, we’d better not treat the electron 
as a point particle with a — 0 or it will end up having infinite mass. And that will 
make it really hard to move. 


So what is the radius of an electron? For the above calculation to be consistent, the 
energy in the electric field can’t be greater than the observed mass of the electron me. 
In other words, we’d better have 
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That, at least, puts a bound on the radius of the electron, which is the best we can 
do using classical physics alone. To give a more precise statement of the radius of the 
electron, we need to turn to quantum mechanics. 


A Quick Foray into Quantum Electrodynamics 


To assign a meaning of “radius” to seemingly point-like particles, we really need the 
machinery of quantum field theory. In that context, the size of the electron is called its 
Compton wavelength. This is the distance scale at which the electron gets surrounded 
by a swarm of electron-positron pairs which, roughly speaking, smears out the charge 
distribution. This distance scale is 


h 
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We see that the inequality (2.29) translates into an inequality on a bunch of fundamental 
constants. For the whole story to hang together, we require 


e2 
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This is an almost famous combination of constants. It’s more usual to define the 
combination 


e2 
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This is known as the fine structure constant. It is dimensionless and takes the value 


Our discussion above requires a < 2. We see that Nature happily meets this require- 
ment. 


2.3.2 The Force Between Electric Dipoles 


As an application of our formula for electrostatic energy, we can compute the force 
between two, far separated dipoles. We place the first dipole, p4, at the origin. It gives 
rise to a potential 
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Now, at some distance away, we place a second dipole. We’ll take this to consist of a 
charge Q at position r and a charge —Q at position r — d, with d <r. The resulting 
dipole moment is p2 = Qd. We’re not interested in the energy stored in each individual 
dipole; only in the potential energy needed to bring the two dipoles together. This is 
given by (2.23), 


U = Q(t) = ọ(r— d)) ~ Ane, (= f m n 
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where, to get to the second line, we’ve Taylor expanded the denominator of the second 
term. This final expression can be written in terms of the second dipole moment. We 
find the nice, symmetric expression for the potential energy of two dipoles separated 
by distance r, 


1 (= -P2 _ 3(p1: r)(p2: 2) 
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But, we know from our first course on dynamics that the force between two objects is 
just given by F = —VU. We learn that the force between two dipoles is given by 
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The strength of the force, and even its sign, depends on the orientation of the two 
dipoles. If pı and pə lie parallel to each other and to r then the resulting force is 
attractive. If pı and p2 point in opposite directions, and lie parallel to r, then the force 
is repulsive. The expression above allows us to compute the general force. 


2.4 Conductors 


Let’s now throw something new into the mix. A conductor is a region of space which 
contains charges that are free to move. Physically, think “metal”. We want to ask 
what happens to the story of electrostatics in the presence of a conductor. There are 
a number of things that we can say straight away: 


e Inside a conductor we must have E = 0. If this isn’t the case, the charges would 
move. But we’re interested in electrostatic situations where nothing moves. 


e Since E = 0 inside a conductor, the electrostatic potential @ must be constant 
throughout the conductor. 


e Since E = 0 and V- E = p/eo, we must also have p = 0. This means that the 
interior of the conductor can’t carry any charge. 


e Conductors can be neutral, carrying both positive and negative charges which 
balance out. Alternatively, conductors can have net charge. In this case, any net 
charge must reside at the surface of the conductor. 


e Since ¢ is constant, the surface of the conductor must be an equipotential. This 
means that any E = —V¢@ is perpendicular to the surface. This also fits nicely 
with the discussion above since any component of the electric field that lies tan- 
gential to the surface would make the surface charges move. 


e If there is surface charge o anywhere in the conductor then, by our previous 
discontinuity result (2.9), together with the fact that E = 0 inside, the electric 
field just outside the conductor must be 


E =ñ (2.31) 
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Problems involving conductors are of a slightly different nature than those we’ve 
discussed up to now. The reason is that we don’t know from the start where the charges 
are, so we don’t know what charge distribution p that we should be solving for. Instead, 
the electric fields from other sources will cause the charges inside the conductor to shift 
around until they reach equilibrium in such a way that E = 0 inside the conductor. In 
general, this will mean that even neutral conductors end up with some surface charge, 
negative in some areas, positive in others, just enough to generate an electric field inside 
the conductor that precisely cancels that due to external sources. 


An Example: A Conducting Sphere 


To illustrate the kind of problem that we have to deal with, it’s probably best just to 
give an example. Consider a constant background electric field. (It could, for example, 
be generated by two charged plates of the kind we looked at in Section 2.1.4). Now 
place a neutral, spherical conductor inside this field. What happens? 


We know that the conductor can’t suffer an electric field inside it. Instead, the 
mobile charges in the conductor will move: the negative ones to one side; the positive 
ones to the other. The sphere now becomes polarised. These charges counteract the 
background electric field such that E = 0 inside the conductor, while the electric field 
outside impinges on the sphere at right-angles. The end result must look qualitatively 
like this: 
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We'd like to understand how to compute the electric field in this, and related, situations. 
We'll give the answer in Section 2.4.4. 


An Application: Faraday Cage 


Consider some region of space that doesn’t contain any charges, surrounded by a con- 
ductor. The conductor sits at constant @ = ġo while, since there are no charges inside, 
we must have V7¢ = 0. But this means that ¢ = ¢p everywhere. This is because, if it 
didn’t then there would be a maximum or minimum of ¢ somewhere inside. And we 
know from the discussion in Section 2.2.5 that this can’t happen. Therefore, inside a 
region surrounded by a conductor, we must have E = 0. 
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This is a very useful result if you want to shield a region from electric fields. In 
this context, the surrounding conductor is called a Faraday cage. As an application, if 
you’re worried that they’re trying to read your mind with electromagnetic waves, then 
you need only wrap your head in tin foil and all concerns should be alleviated. 


2.4.1 Capacitors 


Let’s now solve for the electric field in some conductor problems. 


The simplest examples are capacitors. These are a pair of con- -< 
ductors, one carrying charge Q, the other charge —Q. — 
+Q | ——> | -Q 

Parallel Plate Capacitor —EEE 
To start, we’ll take the conductors to have flat, parallel surfaces —— 
as shown in the figure. We usually assume that the distance d —>z 
between the surfaces is much smaller than VA, where A is the i 

Figure 17: 


area of the surface. This means that we can neglect the effects 
that arise around the edge of plates and we’re justified in assuming that the electric 
field between the two plates is the same as it would be if the plates were infinite in 
extent. The problem reduces to the same one that we considered in Section 2.1.4. The 
electric field necessarily vanishes inside the conductor while, between the plates we have 
the result (2.10), 
E= 22 
€0 

where o = Q/A and we have assumed the plates are separated in the z-direction. We 
define the capacitance C to be 


Q 
Cry 
where V is the voltage or potential difference which is, as the name suggests, the dif- 
ference in the potential ¢ on the two conductors. Since E = —d@/dz is constant, we 
must have 
Qd 
@=-Ez+c > V=¢(0) - ¢(d) = Ed = — 
Aco 
and the capacitance for parallel plates of area A, separated by distance d, is 
Aco 
as 


Because V was proportional to Q, the charge has dropped out of our expression for the 
capacitance. Instead, C depends only on the geometry of the set-up. This is a general 
property; we will see another example below. 
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Capacitors are usually employed as a method to store electrical energy. We can see 
how much. Using our result (2.27), we have 
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This is the energy stored in a parallel plate capacitor. 


Concentric Sphere Capacitor 


Consider a spherical conductor of radius R,. Around this we 
place another conductor in the shape of a spherical shell with 
inner surface lying at radius Rə. We add charge +Q to the 
sphere and —Q to the shell. From our earlier discussion of 
charged spheres and shells, we know that the electric field be- 
tween the two conductors must be 


E= È G Ri<r< Ro 
Anegr? 


Figure 18: 


Correspondingly, the potential is 
Q 


O = Ri<r< Rə 
Amer 


and the capacitance is given by C = 479 R, Ro/ (Rə — Rj). 


2.4.2 Boundary Value Problems 


Until now, we’ve thought of conductors as carrying some fixed charge Q. These con- 
ductors then sit at some constant potential ¢. If there are other conductors in the 
vicinity that carry a different charge then, as we’ve seen above, there will be some fixed 
potential difference, V = A@ between them. 


However, we can also think of a subtly different scenario. Suppose that we instead 
fix the potential @ in a conductor. This means that, whatever else happens, whatever 
other charges are doing all around, the conductor remains at a fixed ġ. It never deviates 
from this value. 


Now, this sounds a bit strange. We’ve seen above that the electric potential of 
a conductor depends on the distance to other conductors and also on the charge it 
carries. If ọ remains constant, regardless of what objects are around it, then it must 
mean that the charge on the conductor is not fixed. And that’s indeed what happens. 
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Having conductors at fixed @ means that charge can flow in and out of the conductor. 
We implicitly assume that there is some background reservoir of charge which the 
conductor can dip into, taking and giving charge so that @ remains constant. 


We can think of this reservoir of charge as follows: suppose that, somewhere in the 
background, there is a huge conductor with some charge Q which sits at some potential 
ġ. To fix the potential of any other conductor, we simply attach it to one of this big 
reservoir-conductor. In general, some amount of charge will flow between them. The 
big conductor doesn’t miss it, while the small conductor makes use of it to keep itself 
at constant @. 


The simplest example of the situation above arises if you connect your conductor to 
the planet Earth. By convention, this is taken to have ¢ = 0 and it ensures that your 
conductor also sits at o = 0. Such conductors are said to be grounded. In practice, 
one may ground a conductor inside a chip in your cell phone by attaching it the metal 
casing. 


Mathematically, we can consider the following problem. Take some number of ob- 
jects, S;. Some of the objects will be conductors at a fixed value of ¢;. Others will 
carry some fixed charge Q;. This will rearrange itself into a surface charge o; such 
that E = 0 inside while, outside the conductor, E = 47o0n. Our goal is to understand 
the electric field that threads the space between all of these objects. Since there is no 
charge sitting in this space, we need to solve the Laplace equation 


V’ =0 
subject to one of two boundary conditions 
e Dirichlet Boundary Conditions: The value of ¢ is fixed on a given surface S; 


e Neumann Boundary Conditions: The value of V¢ọ - ñ is fixed perpendicular to a 
given surface S; 


Notice that, for each S;, we need to decide which of the two boundary conditions we 
want. We don’t get to chose both of them. We then have the following theorem. 


Theorem: With either Dirichlet or Neumann boundary conditions chosen on each 
surface S;, the Laplace equation has a unique solution. 
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Proof: Suppose that there are two solutions, ¢; and ¢2 with the same specified bound- 
ary conditions. Let’s define f = ¢, — ¢2. We can look at the following expression 


f EIA [a Vf- Vf (2.32) 
V V 


where the V? f term vanishes by the Laplace equation. But, by the divergence theorem, 
we know that 
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However, if we’ve picked Dirichlet boundary conditions then f = 0 on the boundary, 
while Neumann boundary conditions ensure that Vf = 0 on the boundary. This means 
that the integral vanishes and, from (2.32), we must have Vf = 0 throughout space. 
But if we have imposed Dirichlet boundary conditions somewhere, then f = 0 on that 
boundary and so f = 0 everywhere. Alternatively, if we have Neumann boundary 
conditions on all surfaces than Vf = 0 everywhere and the two solutions @; and ¢2 
can differ only by a constant. But, as discussed in Section 2.2, this constant has no 


physical meaning. 


2.4.3 Method of Images 


For particularly simple situations, there is a rather cute method that we can use to 
solve problems involving conductors. Although this technique is somewhat limited, it 
does give us some good intuition for what’s going on. It’s called the method of images. 


A charged particle near a conducting plane 


Consider a conductor which fills all of space x < 0. We’ll ground this conductor so that 
o = 0 for x < 0. Then, at some point x = d > 0, we place a charge q. What happens? 


We’re looking for a solution to the Poisson equation with a delta-function source at 
x = d = (d,0,0), together with the requirement that ¢ = 0 on the plane x = 0. From 
our discussion in the previous section, there’s a unique solution to this kind of problem. 
We just have to find it. 


Here’s the clever trick. Forget that there’s a conductor at x < 0. Instead, suppose 
that there’s a charge —q placed opposite the real charge at x = —d. This is called the 
image charge. The potential for this pair of charges is just the potential 


o 1 q B q 
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Figure 19: A particle near a conducting Figure 20: ...looks like a dipole 
plane... 


By construction, this has the property that @ = 0 for x = 0 and it has the correct 
source at x = (d,0,0). Therefore, this must be the right solution when z > 0. A 
cartoon of this is shown in the figures. Of course, it’s the wrong solution inside the 
conductor where the electric field vanishes. But that’s trivial to fix: we just replace it 
with ¢ = 0 for x < 0. 


With the solution (2.33) in hand, we can now dispense with the image charge and 
explore what’s really going on. We can easily compute the electric field from (2.33). If 
we focus on the electric field in the x direction, it is 

E. = OG _ q (A t5) hela 
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Meanwhile, FE, = 0 for x < 0. The discontinuity of E, at the surface of the conductor 


determines the induced surface charge (2.31). It is 
q d 
2r (d2 + y2 + 22)3/2 


We see that the surface charge is mostly concentrated on the plane at the point closest 
3/2 
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to the real charge. As you move away, it falls off as 1/(y? + 2°) We can compute 


the total induced surface charge by doing a simple integral, 


induced = [eee g == 


The charge induced on the conductor is actually equal to the image charge. This is 
always true when we use the image charge technique. 


Finally, as far as the real charge +q is concerned, as long as it sits at x > 0, it feels 
an electric field which is identical in all respects to the field due to an image charge —q 
embedded in the conductor. This means, in particular, that it will experience a force 

2 
q N 
F=— x 
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This force is attractive, pulling the charge towards the conductor. 
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A charged particle near a conducting sphere 


We can play a similar game for a particle near a grounded, conducting sphere. The 
details are only slightly more complicated. We’ll take the sphere to sit at the origin 
and have radius R. The particle has charge q and sits at x = d = (d,0,0), withd > R. 
Our goal is to place an image charge q’ somewhere inside the sphere so that ¢ = 0 on 
the surface. 


There is a way to derive the answer using conformal transformations. However, 
here we’ll just state it. You should choose a particle of charge q' = —qR/d, placed at 
x = R?/d and, by symmetry, y = z = 0. A cartoon of this is shown in the figure. 
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Figure 21: A particle near a conducting Figure 22: ...looks like a slightly different 


sphere... dipole 
The resulting potential is 
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With a little algebra, you can check that ¢ = 0 whenever z? + y? +2? = R?. With 
a little more algebra, you can easily determine the induced surface charge and check 


that, when integrated over the sphere, we indeed have dinqucea = q’. Once again, our 
charge experiences a force towards the conductor. 


Above we’ve seen how to treat a grounded sphere. But what if we instead have an 
isolated conductor with some fixed charge, Q? It’s easy to adapt the problem above. 
We simply add the necessary excess charge Q — q’ as an image that sits at the origin of 
the sphere. This will induce an electric field which emerges radially from the sphere. 
Because of the principle of superposition, we just add this to the previous electric field 
and see that it doesn’t mess up the fact that the electric field is perpendicular to the 
surface. This is now our solution. 


2.4.4 Many many more problems 


There are many more problems that you can cook up involving conductors, charges and 
electrostatics. Very few of them can be solved by the image charge method. Instead, you 
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need to develop a number of basic tools of mathematical physics. A fairly comprehensive 
treatment of this can be found in the first 100 or so pages of Jackson. 


For now, I would just like to leave you with the solution to the example that kicked 
off this section: what happens if you take a conducting sphere and place it in a constant 
electric field? This problem isn’t quite solved by the image charge method. But it’s 
solved by something similar: an image dipole. 


We’ll work in spherical polar coordinates and chose the original, constant electric 
field to point in the z direction, 


Eo = Eo% = oo = —Eoz = — Eor cos 0 


Take the conducting sphere to have radius R and be centered on the the origin. Let’s 
add to this an image dipole with potential (2.18). We’ll place the dipole at the origin, 
and orient it along the z axis like so: 
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Figure 23: A conducting sphere between Figure 24: ...looks like a dipole between 
charged plates... the plates 


The resulting potential is 


3 
o = — Fo (r- 5) cos 0 
r 


Since we’ve added a dipole term, we can be sure that this still solves the Laplace 
equation outside the conductor. Moreover, by construction, ¢ = 0 when r = R. This 
is all we wanted from our solution. The induced surface charge can again be computed 
by evaluating the electric field just outside the conductor. It is 


o 2R? 
0 = -0 — €o Lo (1 + 5) 


We see that the surface charge is positive in one hemisphere and negative in the other. 


cos 0 = 3c0 E% cos 0 
r=R 


The total induced charge averages to zero. 
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2.4.5 A History of Electrostatics 


Perhaps the simplest demonstration of the attractive properties of electric charge comes 
from rubbing a balloon on your head and sticking it to the wall. This phenomenon was 
known, at least in spirit, to the ancient Greeks and is credited to Thales of Miletus 
around 600 BC. Although, in the absence of any ancient balloons, he had to make do 
with polishing pieces of amber and watching it attract small objects. 


A systematic, scientific approach to electrostatics starts with William Gilbert, physi- 
cist, physician and one-time bursar of St Johns College, Cambridge. (Rumour has it 
that he’d rather have been at Oxford.) His most important work, De Magnete, pub- 
lished in 1600 showed, among other things, that many materials, not just amber, could 
be electrified. With due deference, he referred to these as “electrics”, derived from the 
Greek “nAextpov” (electron) meaning “amber”. These are materials that we now call 
“insulators” . 


There was slow progress over the next 150 years, much of it devoted to building ma- 
chines which could store electricity. A notable breakthrough came from the experiments 
of the little-known English scientist Stephen Grey, who was the first to appreciate that 
the difficulty in electrifying certain objects is because they are conductors, with any 
charge quickly flowing through them and away. Grey spent most of his life as an am- 
ateur astronomer, although his amateur status appears to be in large part because he 
fell foul of Isaac Newton who barred his entry into more professional scientific circles. 
He performed his experiments on conductors in the 1720s, late in life when the lack of 
any income left him destitute and pensioned to Chaterhouse (which was, perhaps, the 
world’s fanciest poorhouse). Upon Newton’s death, the scientific community clamoured 
to make amends. Grey was awarded the Royal Society’s first Copley medal. Then, pre- 
sumably because they felt guilty, he was also awarded the second. Grey’s experiments 
were later reproduced by the French chemist Charles François de Cisternay DuF ay, who 
came to the wonderful conclusion that all objects can be electrified by rubbing apart 
from “metals, liquids and animals”. He does not, to my knowledge, state how much 
rubbing of animals he tried before giving up. He was also the first to notice that static 
electricity can give rise to both attractive and repulsive forces. 


By the 1750s, there were many experiments on electricity, but little theory to explain 
them. Most ideas rested on a fluid description of electricity, but arguments raged over 
whether a single fluid or two fluids were responsible. The idea that there were both 
positive and negative charges, then thought of as a surplus and deficit of fluid, was 
introduced independently by the botanist William Watson and the US founding father 
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Benjamin Franklin. Franklin is arguably the first to suggest that charge is conserved 
although his statement wasn’t quite as concise as the continuity equation: 


It is now discovered and demonstrated, both here and in Europe, that the 
Electrical Fire is a real Element, or Species of Matter, not created by the 
Friction, but collected only. 


Benjamin Franklin, 1747 


Still, it’s nice to know that charge is conserved both in the US and in Europe. 


A quantitative understanding of the theory of electrostatics came only in the 1760s. 
A number of people suggested that the electrostatic force follows an inverse-square 
law, prominent among them Joseph Priestly who is better known for the discovery of 
Oxygen and, of at least equal importance, the invention of soda water. In 1769, the 
Scottish physicist John Robison announced that he had measured the force to fall off as 
1/r?°°. This was before the invention of error bars and he seems to receive little credit. 
Around the same time, the English scientist Henry Cavendish, discover of Hydrogen 
and weigher of the Earth, performed a number of experiments to demonstrate the 
inverse-square law but, as with his many of his other electromagnetic discoveries, he 
chose not to publish. It was left to French physicist Charles Augustin de Coulomb to 
clean up, publishing the results of his definitive experiments in 1785 on the force that 
now carries his name. 


In its final form, Coulomb’s law becomes transmuted into Gauss’ law. For once, this 
was done by the person after whom it’s named. Gauss derived this result in 1835, 
although it wasn’t published until 1867. 
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3. Magnetostatics 


Charges give rise to electric fields. Current give rise to magnetic fields. In this section, 
we will study the magnetic fields induced by steady currents. This means that we are 
again looking for time independent solutions to the Maxwell equations. We will also 
restrict to situations in which the charge density vanishes, so p = 0. We can then set 
E = 0 and focus our attention only on the magnetic field. We’re left with two Maxwell 
equations to solve: 


V x B= umoJ (3.1) 
and 
V:-B=0 (3.2) 


If you fix the current density J, these equations have a unique solution. Our goal in 
this section is to find it. 


Steady Currents 


Before we solve (3.1) and (3.2), let’s pause to think about the kind of currents that we’re 
considering in this section. Because p = 0, there can’t be any net charge. But, of course, 
we still want charge to be moving! This means that we necessarily have both positive 
and negative charges which balance out at all points in space. Nonetheless, these 
charges can move so there is a current even though there is no net charge transport. 


This may sound artificial, but in fact it’s exactly what happens in a typical wire. In 
that case, there is background of positive charge due to the lattice of ions in the metal. 
Meanwhile, the electrons are free to move. But they all move together so that at each 
point we still have p = 0. The continuity equation, which captures the conservation of 
electric charge, is 


Since the charge density is unchanging (and, indeed, vanishing), we have 
Vas =0 


Mathematically, this is just saying that if a current flows into some region of space, an 
equal current must flow out to avoid the build up of charge. Note that this is consistent 
with (3.1) since, for any vector field, V - (V x B) = 0. 
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3.1 Ampeére’s Law 


The first equation of magnetostatics, 
Vx B= umoJ (3:3) 


is known as Ampére’s law. As with many of these vector dif- 
ferential equations, there is an equivalent form in terms of inte- 
grals. In this case, we choose some open surface S with boundary 

C = OS. Integrating (3.3) over the surface, we can use Stokes’ 3 
theorem to turn the integral of V x B into a line integral over 
the boundary C, 


[Vv xB aS = $ B-dr=po f 3-08 Figure 25: 
S c S 


Recall that there’s an implicit orientation in these equations. The surface S comes 


with a normal vector n which points away from S in one direction. The line integral 
around the boundary is then done in the right-handed sense, meaning that if you stick 
the thumb of your right hand in the direction n then your fingers curl in the direction 
of the line integral. 


The integral of the current density over the surface S is the same thing as the total 
current J that passes through S. Ampére’s law in integral form then reads 


$ B - dr = ml (3.4) 
C 


For most examples, this isn’t sufficient to determine the form of the magnetic field; 
we'll usually need to invoke (3.2) as well. However, there is one simple example where 
symmetry considerations mean that (3.4) is all we need... 


3.1.1 A Long Straight Wire 


Consider an infinite, straight wire carrying current J. We’ll take it to point in the z 
direction. The symmetry of the problem is jumping up and down telling us that we 
need to use cylindrical polar coordinates, (r,y,z), where r = ,/x?+y? is the radial 
distance away from the wire. 


We take the open surface S to lie in the x — y plane, centered on the wire. For the 
line integral in (3.4) to give something that doesn’t vanish, it’s clear that the magnetic 
field has to have some component that lies along the circumference of the disc. 
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But, by the symmetry of the problem, that’s actually the 
only component that B can have: it must be of the form 
B = Bir)p. (If this was a bit too quick, we’ll derive 
this more carefully below). Any magnetic field of this 
form automatically satisfies the second Maxwell equation 
V-B = 0. We need only worry about Ampére’s law which 
tells us 
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Figure 26: 


We see that the strength of the magnetic field is 
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The magnetic field circles the wire using the right-hand rule”: stick the thumb of your 
right hand in the direction of the current and your fingers curl in the direction of the 
magnetic field. 


Note that the simplest example of a magnetic field falls off as 1/r. In contrast, the 
simplest example of an electric field — the point charge — falls of as 1/r?. You can trace 
this difference back to the geometry of the two situations. Because magnetic fields 
are sourced by currents, the simplest example is a straight line and the 1/r fall-off is 
because there are two transverse directions to the wire. Indeed, we saw in Section 2.1.3 
that when we look at a line of charge, the electric field also drops off as 1/r. 


3.1.2 Surface Currents and Discontinuities 


Consider the flat plane lying at z = 0 with a surface current density that we’ll call K. 
Note that K is the current per unit length, as opposed to J which is the current per 
unit area. You can think of the surface current as a bunch of wires, all lying parallel 
to each other. 


We’ll take the current to lie in the x-direction: K = Kx as shown below. 
> 
y 
/ x K 


From our previous result, we know that the B field should curl around the current in 


the right-handed sense. But, with an infinite number of wires, this can only mean that 
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B is oriented along the y direction. In fact, from the symmetry of the problem, it must 
look like 


with B pointing in the —y direction when z > 0 and in the +y direction when z < 0. 
We write 


B = —B(z)y 


with B(z) = —B(—z). We invoke Ampère’s law using the following open surface: 


with length L in the y direction and extending to +z. We have 
$ B -dr = LB(z) — LB(-z) = 2LB(z) = KL 
c 


so we find that the magnetic field is constant above an infinite plane of surface current 


_ Holt 


Ba) = 


z>0 


This is rather similar to the case of the electric field in the presence of an infinite plane 
of surface charge. 


The analogy with electrostatics continues. The magnetic field is not continuous 
across a plane of surface current. We have 


B(z => 0*) — B(z 30°) = mK 


In fact, this is a general result that holds for any surface current K. We can prove this 
statement by using the same curve that we used in the Figure above and shrinking it 


= 44 = 


until it barely touches the surface on both sides. If the normal to the surface is n and 
B+ denotes the magnetic field on either side of the surface, then 


nx B),— nx B|_ = mK (3.6) 


Meanwhile, the magnetic field normal to the surface is continuous. (To see this, you 
can use a Gaussian pillbox, together with the other Maxwell equation V - B = 0). 


When we looked at electric fields, we saw that the normal component was discontinu- 
ous in the presence of surface charge (2.9) while the tangential component is continuous. 
For magnetic fields, it’s the other way around: the tangential component is discontin- 
uous in the presence of surface currents. 


A Solenoid 


A solenoid consists of a surface current that travels around a cylin- B 
der. It’s simplest to think of a single current-carrying wire winding 
many times around the outside of the cylinder. (Strictly speaking, 


the cross-sectional shape of the solenoid doesn’t have to be a circle — — 

it can be anything. But we’ll stick with a circle here for simplicity). 

To make life easy, we’ll assume that the cylinder is infinitely long. Ca 
This just means that we can neglect effects due to the ends. 


We’ll again use cylindrical polar coordinates, (r, y, z), with the 
axis of the cylinder along z. By symmetry, we know that B will Figure 27: 
point along the z-axis. Its magnitude can depend only on the radial 
distance: B = B(r)z. Once again, any magnetic field of this form immediately satisfies 
V-B=0. 


We solve Ampére’s law in differential form. Anywhere other than —>—_ 
the surface of the solenoid, we have J = 0 and 

dB 
a 0 = B(r)= constant IE 


Outside the solenoid, we must have B(r) = 0 since B(r) is constant 


VxB=0 > 


and we know B(r) + 0 as r > oo. To figure out the magnetic field 


inside the solenoid, we turn to the integral form of Ampére’s law 


a 
and consider the surface S, bounded by the curve C shown in the 
< 
figure. Only the line that runs inside the solenoid contributes to 
the line integral. We have Figure 28: 


Bde = BL=poINL 
C 
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where N is the number of windings of wire per unit length. We learn that inside the 
solenoid, the constant magnetic field is given by 


B = oI N2 (3.7) 


Note that, since K = IN, this is consistent with our general formula for the disconti- 
nuity of the magnetic field in the presence of surface currents (3.6). 


3.2 The Vector Potential 


For the simple current distributions of the last section, symmetry considerations were 
enough to lead us to a magnetic field which automatically satisfied 


V-B=0 (3.8) 


But, for more general currents, this won’t be the case. Instead we have to ensure that 
the second magnetostatic Maxwell equation is also satisfied. 


In fact, this is simple to do. We are guaranteed a solution to V - B = 0 if we write 
the magnetic field as the curl of some vector field, 


B=VxA (3.9) 


Here A is called the vector potential. While magnetic fields that can be written in the 
form (3.9) certainly satisfy V - B = 0, the converse is also true; any divergence-free 
magnetic field can be written as (3.9) for some A. 


(Actually, this previous sentence is only true if our space has a suitably simple 
topology. Since we nearly always think of space as R? or some open ball on R?, 
we rarely run into subtleties. But if space becomes more interesting then the possible 
solutions to V -B = 0 also become more interesting. This is analogous to the story of 
the electrostatic potential that we mentioned briefly in Section 2.2). 


Using the expression (3.9), Ampère’s law becomes 
VxB=-V?A+V(V-A) = pod (3.10) 


where, in the first equality, we’ve used a standard identity from Vector Calculus. This 
is the equation that we have to solve to determine A and, through that, B. 
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3.2.1 Magnetic Monopoles 


Above, we dispatched with the Maxwell equation V - B = 0 fairly quickly by writing 
B = V x A. But we never paused to think about what this equation is actually telling 
us. In fact, it has a very simple interpretation: it says that there are no magnetic 
charges. A point-like magnetic charge g would source the magnetic field, giving rise a 
1/r? fall-off 


_ î 
Agr? 


An object with this behaviour is usually called a magnetic monopole. Maxwell’s equa- 
tions says that they don’t exist. And we have never found one in Nature. 


However, we could ask: how robust is this conclusion? Are we sure that magnetic 
monopoles don’t exist? After all, it’s easy to adapt Maxwell’s equations to allow for 
presence of magnetic charges: we simply need to change (3.8) to read V-B = pm where 
Pm is the magnetic charge distribution. Of course, this means that we no longer get to 
use the vector potential A. But is that such a big deal? 


The twist comes when we turn to quantum mechanics. Because in quantum mechan- 
ics we’re obliged to use the vector potential A. Not only is the whole framework of 
electromagnetism in quantum mechanics based on writing things using A, but it turns 
out that there are experiments that actually detect certain properties of A that are lost 
when we compute B = V x A. I won’t explain the details here, but if you’re interested 
then look up the “Aharonov-Bohm effect” in the lectures on Solid State Physics. 


Monopoles After All? 


To summarise, magnetic monopoles have never been observed. We have a law of physics 
(3.8) which says that they don’t exist. And when we turn to quantum mechanics we 
need to use the vector potential A which automatically means that (3.8) is true. It 
sounds like we should pretty much forget about magnetic monopoles, right? 


Well, no. There are actually very good reasons to suspect that magnetic monopoles 
do exist. The most important part of the story is due to Dirac. He gave a beautiful 
argument which showed that it is in fact possible to introduce a vector potential A 
which allows for the presence of magnetic charge, but only if the magnetic charge g is 
related to the charge of the electron e by 


ge=2rin neEZ (3.11) 


This is known as the Dirac quantization condition. 


wA = 


Moreover, following work in the 1970s by ’t Hooft and Polyakov, we now realise that 
magnetic monopoles are ubiquitous in theories of particle physics. Our best current 
theory — the Standard Model — does not predict magnetic monopoles. But every theory 
that tries to go beyond the Standard Model, whether Grand Unified Theories, or String 
Theory or whatever, always ends up predicting that magnetic monopoles should exist. 
They’re one of the few predictions for new physics that nearly all theories agree upon. 


These days most theoretical physicists think that magnetic monopoles probably exist 
and there have been a number of experiments around the world designed to detect them. 
However, while theoretically monopoles seem like a good bet, their future observational 
status is far from certain. We don’t know how heavy magnetic monopoles will be, but 
all evidence suggests that producing monopoles is beyond the capabilities of our current 
(or, indeed, future) particle accelerators. Our only hope is to discover some that Nature 
made for us, presumably when the Universe was much younger. Unfortunately, here 
too things seem against us. Our best theories of cosmology, in particular inflation, 
suggest that any monopoles that were created back in the Big Bang have long ago been 
diluted. At a guess, there are probably only a few floating around our entire observable 
Universe. The chances of one falling into our laps seem slim. But I hope I’m wrong. 


3.2.2 Gauge Transformations 


The choice of A in (3.9) is far from unique: there are lots of different vector potentials 
A that all give rise to the same magnetic field B. This is because the curl of a gradient 
is automatically zero. This means that we can always add any vector potential of the 
form Vx for some function x and the magnetic field remains the same, 


A'=A+VxX > VXxXA'=VXA 


Such a change of A is called a gauge transformation. As we will see in Section 5.3.1, it is 
closely tied to the possible shifts of the electrostatic potential ġ. Ultimately, such gauge 
transformations play a key role in theoretical physics. But, for now, we’re simply going 
to use this to our advantage. Because, by picking a cunning choice of x, it’s possible 
to simplify our quest for the magnetic field. 


Claim: We can always find a gauge transformation y such that A’ satisfies V- A’ = 0. 
Making this choice is usually referred to as Coulomb gauge. 


Proof: Suppose that we’ve found some A which gives us the magnetic field that 
we want, so V x A = B, but when we take the divergence we get some function 
V-A=1(x). We instead choose A’ = A + Vy which now has divergence 


V-A'=V-A+V°x=Y++V’x 
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So if we want V - A’ = 0, we just have to pick our gauge transformation y to obey 
V’x = -4 


But this is just the Poisson equation again. And we know from our discussion in Section 
2 that there is always a solution. (For example, we can write it down in integral form 


using the Green’s function). 


Something a Little Misleading: The Magnetic Scalar Potential 


There is another quantity that is sometimes used called the magnetic scalar potential, 
Q. The idea behind this potential is that you might be interested in computing the 
magnetic field in a region where there are no currents and the electric field is not 
changing with time. In this case, you need to solve V x B = 0, which you can do by 
writing 


B= —-VQ 


Now calculations involving the magnetic field really do look identical to those involving 
the electric field. 


However, you should be wary of writing the magnetic field in this way. As we'll 
see in more detail in Section 5.3.1, we can always solve two of Maxwell’s equations by 
writing E and B in terms of the electric potential @ and vector potential A and this 
formulation becomes important as we move onto more advanced areas of physics. In 
contrast, writing B = — VQ is only useful in a limited number of situations. The reason 
for this really gets to the heart of the difference between electric and magnetic fields: 
electric charges exist; magnetic charges don’t! 


3.2.3 Biot-Savart Law 


We’re now going to use the vector potential to solve for the magnetic field B in the 
presence of a general current distribution. From now, we’ll always assume that we’re 
working in Coulomb gauge and our vector potential obeys V - A = 0. Then Ampére’s 
law (3.10) becomes a whole lot easier: we just have to solve 


V7A = -uoJ (3.12) 


But this is just something that we’ve seen already. To see why, it’s perhaps best to 
write it out in Cartesian coordinates. This then becomes three equations, 


VA; = -uoi (i= 1,2,3) (3.13) 


and each of these is the Poisson equation. 
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It’s worth giving a word of warning at this point: the expression V?A is simple in 
Cartesian coordinates where, as we’ve seen above, it reduces to the Laplacian on each 
component. But, in other coordinate systems, this is no longer true. The Laplacian 
now also acts on the basis vectors such as r and ~. So in these other coordinate 
systems, VA is a little more of a mess. (You should probably use the identity V? A = 
-V x (V x A) + V(V - A) if you really want to compute in these other coordinate 
systems). 


Anyway, if we stick to Cartesian coordinates then everything is simple. In fact, 
the resulting equations (3.13) are of exactly the same form that we had to solve in 
electrostatics. And, in analogy to (2.21), we know how to write down the most general 
solution using Green’s functions. It is 


aie) = f ate 2 


An |x — x’| 


Or, if you’re feeling bold, you can revert back to vector notation and write 


A(x) = 2 I pra (3.14) 


An |x — x’| 


where you’ve just got to remember that the vector index on A links up with that on J 
(and not on x or x’). 
Checking Coulomb Gauge 


We've derived a solution to (3.12), but this is only a solution to Ampére’s equation 
(3.10) if the resulting A obeys the Coulomb gauge condition, V- A = 0. Let’s now 
check that it does. We have 


VAs) = ferv (Zee) 


where you need to remember that the index of V is dotted with the index of J, but the 
derivative in V is acting on x, not on x’. We can write 


V- A(x) = E | r I(x’). V (——) 


4r Jy |x —x 


1 
= a Ër J(x) -Vv (a) 


T Jy x— x] 


Here we’ve done something clever. Now our V’ is differentiating with respect to x’. To 
get this, we’ve used the fact that if you differentiate 1/|x — x’| with respect to x then 
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you get the negative of the result from differentiating with respect to x’. But since V’ 
sits inside an f d'7’ integral, it’s ripe for integrating by parts. This gives 


Y. A(x) = i P lv. ( | -— V' - I(x’) (a) 


v x-x |x — x’| 


The second term vanishes because we’re dealing with steady currents obeying V -J = 0. 
The first term also vanishes if we take the current to be localised in some region of space, 
V CV so that J(x) = 0 on the boundary OV. We’ll assume that this is the case. We 
conclude that 


V:-A=0 


and (3.14) is indeed the general solution to the Maxwell equations (3.1) and (3.2) as 
we'd hoped. 
The Magnetic Field 


From the solution (3.14), it is simple to compute the magnetic field B= V x A. Again, 
we need to remember that the V acts on the x in (3.14) rather than the x’. We find 


Boy = 2 f ee XO (3.15) 


An |x — x|? 


This is known as the Biot-Savart law. It describes the magnetic field due to a general 
current density. 


There is a slight variation on (3.15) which more often goes by the name of the Biot- 
Savart law. This arises if the current is restricted to a thin wire which traces out a 
curve C. Then, for a current density J passing through a small volume ôV, we write 
JOV = (JA)dx where A is the cross-sectional area of the wire and 6z lies tangent to 
C. Assuming that the cross-sectional area is constant throughout the wire, the current 
I = JA is also constant. The Biot-Savart law becomes 


a | EA (3.16) 


An |x — x’|8 


This describes the magnetic field due to the current J in the wire. 


An Example: The Straight Wire Revisited 


Of course, we already derived the answer for a straight wire in (3.5) without using this 
fancy vector potential technology. Before proceeding, we should quickly check that the 
Biot-Savart law reproduces our earlier result. As before, we’ll work in cylindrical polar 
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coordinates. We take the wire to point along the z axis and use 
r? = r? + y? as our radial coordinate. This means that the line 
element along the wire is parametrised by dx’ = zdz and, for a point 
x away from the wire, the vector dx’ x (x—x’) points along the tangent 


to the circle of radius r, 


So we have 


Figure 29: 


weg j Hol 
B= = 
i i (r2 + z2)3/2 ime" 


which is the same result we found earlier (3.5). 


3.2.4 A Mathematical Diversion: The Linking Number 


There’s a rather cute application of these ideas to pure mathematics. Consider two 
closed, non-intersecting curves, C and C’, in R3. For each pair of curves, there is an 
integer n € Z called the linking number which tells you how many times one of the 
curves winds around the other. For example, here are pairs of curves with linking 
number |n| = 0, 1 and 2. 


OO W C2 


Figure 30: Curves with linking number n = 0, n = 1 and n = 2. 


To determine the sign of the linking number, we need to specify the orientation of each 
curve. In the last two figures above, the linking numbers are negative, if we traverse 
both red and blue curves in the same direction. The linking numbers are positive if we 
traverse one curve in a clockwise direction, and the other in an anti-clockwise direction. 


Importantly, the linking number doesn’t change as you deform either curve, provided 


that the two curves never cross. In fancy language, the linking number is an example 
of a topological invariant. 
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There is an integral expression for the linking number, first written down by Gauss 
during his exploration of electromagnetism. The Biot-Savart formula (3.16) offers a 
simple physics derivation of Gauss’ expression. Suppose that the curve C carries a 
current I. This sets us a magnetic field everywhere in space. We will then compute 
fo B-dx’ around another curve C. (If you want a justification for computing $o B-dx' 
then you can think of it as the work done when transporting a magnetic monopole of 
unit charge around C, but this interpretation isn’t necessary for what follows.) The 
Biot-Savart formula gives 


f BE) dx g =f ax § E (x — x) 
1 1 |x — x? 


where we’ve changed our conventions somewhat from (3.16): now x labels coordinates 
on C while x’ labels coordinates on C”. 


Meanwhile, we can also use Stokes’ theorem, followed by Ampére’s law, to write 


f Bœ) -dx = | (Vx B)-d8=p0 | 3-48 


where S” is a surface bounded by C”. The current is carried by the other curve, C, 
which pierces S’ precisely n times, so that 


$ B(x’) -ax = po | J -dS = n uol 
7 Ss! 


Comparing the two equations above, we arrive at Gauss’ double-line integral expression 


dx x (x — x) 
: al 
~ At = ff x £ ix — xB oo 


Note that our final expression is symmetric in C and C”, even though these two curves 


for the linking number n, 


played a rather different physical role in the original definition, with C carrying a 
current, and C” the path traced by some hypothetical monopole. To see that the 
expression is indeed symmetric, note that the triple product can be thought of as the 
determinant det(x’,x,x’ — x). Swapping x and x’ changes the order of the first two 
vectors and changes the sign of the third, leaving the determinant unaffected. 


The formula (3.17) is rather pretty. It’s not at all obvious that the right-hand-side 
doesn’t change under (non-crossing) deformations of C and C”; nor is it obvious that 
the right-hand-side must give an integer. Yet both are true, as the derivation above 
shows. This is the first time that ideas of topology sneak into physics. It’s not the last. 
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3.3 Magnetic Dipoles 


We’ve seen that the Maxwell equations forbid magnetic monopoles with a long-range 
B ~ 1/r? fall-off (3.11). So what is the generic fall-off for some distribution of currents 
which are localised in a region of space? In this section we will see that, if you’re 
standing suitably far from the currents, you'll typically observe a dipole-like magnetic 
field. 


3.3.1 A Current Loop 


We start with a specific, simple example. Consider 


a circular loop of wire C of radius R carrying a 
current J. We can guess what the magnetic field 

looks like simply by patching together our result 

for straight wires: it must roughly take the shape 

shown in the figure However, we can be more ac- Y 
curate. Here we restrict ourselves only to the mag- 

netic field far from the loop. Figure si: 


To compute the magnetic field far away, we won’t 
start with the Biot-Savart law but instead return to the original expression for A given 
n (3.14). We’re going to return to the notation in which a point in space is labelled as 
r rather than x. (This is more appropriate for long-distance distance fields which are 
essentially an expansion in r = |r|). The vector potential is then given by 


TET 


At Ir-r] 


Writing this in terms of the current J (rather than the current density J), we have 


jee 
|r = r'| 


We want to ask what this looks like far from the loop. Just as we did for the electrostatic 


potential, we can Taylor expand the integrand using (2.22), 


1 o1 r-r 
r—r| r r3 


So that 


ap’ 
A(r) = roa dr’ G ie = + = (3.18) 
C r T 
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The first term in this expansion vanishes because we’re integrating around a circle. 
This is just a reflection of the fact that there are no magnetic monopoles. For the 
second term, there’s a way to write it in slightly more manageable form. To see this, 
let’s introduce an arbitrary constant vector g and use this to look at 


fag (er) 


Recall that, from the point of view of this integral, both g and r are constant vectors; 
it’s the vector r’ that we’re integrating over. This is now the kind of line integral of a 
vector that allows us to use Stokes’ theorem. We have 


$ dr'-g(r-r)= fas -V x (g(r-r))= | AS; Eijk; (gerri) 
c s s 


where, in the final equality, we’ve resorted to index notation to help us remember what’s 
connected to what. Now the derivative 0’ acts only on the r’ and we get 


f gær) = f dSn =g: f dS xr 
c S S 


But this is true for all constant vectors g which means that it must also hold as a vector 
identity once we strip away g. We have 


f de (rr) =S xr 
c 


where we’ve introduced the vector area S of the surface S bounded by C, defined as 


s= Jas 
S 


If the boundary Č lies in a plane — as it does for us — then the vector S points out of 
the plane. 


Now let’s apply this result to our vector potential (3.18). With the integral over r’, 
we can treat r as the constant vector g that we introduced in the lemma. With the 
first term vanishing, we’re left with 


io MX 
4r 3 


A(r) (3.19) 


where we’ve introduced the magnetic dipole moment 


m= IS 
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This is our final, simple, answer for the long-range behaviour of the vector potential 
due to a current loop. It remains only to compute the magnetic field. A little algebra 
gives 


B(r) = T Ge dias =) (3.20) 


Now we see why m is called the magnetic dipole; this form of the magnetic field is 
exactly the same as the dipole electric field (2.19). 


I stress that the B field due to a current loop and E field due to two charges don’t 
look the same close up. But they have identical “dipole” long-range fall-offs. 
3.3.2 General Current Distributions 


We can now perform the same kind of expansion for a general current distribution J 
localised within some region of space. We use the Taylor expansion (2.22) in the general 
form of the vector potential (3.14), 


A;(r) = e fa l e fa (5 pE a (3.21) 


Ar lr—r’| 4r 


where we’re using a combination of vector and index notation to help remember how 
the indices on the left and right-hand sides match up. 


The first term above vanishes. Heuristically, this is because currents can’t stop and 
end, they have to go around in loops. This means that the contribution from one part 
must be cancelled by the current somewhere else. To see this mathematically, we use 
the slightly odd identity 


Oj(Jjri) = (Oj 55) ri + Ji = Si (3.22) 


where the last equality follows from the continuity condition V -J = 0. Using this, 
we see that the first term in (3.21) is a total derivative (of 0/Or; rather than 0/0r;) 
which vanishes if we take the integral over R and keep the current localised within 
some interior region. 


For the second term in (3.21) we use a similar trick, now with the identity 
OS Ge) = (jJi )rirk + Jir k + Thi = Jirk + Jiri 


Because J in (3.21) is a function of r’, we actually need to apply this trick to the 
Jir; terms in the expression. We once again abandon the boundary term to infinity. 
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Dropping the argument of J, we can use the identity above to write the relevant piece 
of the second term as 


1 
per Jirjr; = fer Aeh — Jir) = fe : 5 (J; (r-r’) —ri(J -r)) 


But now this is in a form that is ripe for the vector product identity a x (b x c) = 
b(a-c) —c(a-b). This means that we can rewrite this term as 


1 
per J@-r)= ar x fer Jxr (3.23) 
With this in hand, we see that the long distance fall-off of any current distribution 


again takes the dipole form (3.19) 


fo mxr 
4r r 


A(r) 


now with the magnetic dipole moment given by the integral, 
1 Bl l 1 
m=5 drr <r) (3.24) 


Just as in the electric case, the multipole expansion continues to higher terms. This 
time you need to use vector spherical harmonics. Just as in the electric case, if you 
want further details then look in Jackson. 


3.4 Magnetic Forces 


We’ve seen that a current produces a magnetic field. But a current is simply moving 
charge. And we know from the Lorentz force law that a charge q moving with velocity 
v will experience a force 


F=qvxB 


This means that if a second current is placed somewhere in the neighbourhood of the 
first, then they will exert a force on one another. Our goal in this section is to figure 
out this force. 


3.4.1 Force Between Currents 


Let’s start simple. Take two parallel wires carrying currents J; and I, respectively. 
We’ll place them a distance d apart in the x direction. 


se 


The current in the first wire sets up a magnetic field 


: : : : I 
(3.5). So if the charges in the second wire are moving with i . 
velocity v, they will each experience a force 
Bı 
Holi \ a 
F = qv x B = qv x | — 
q q (£ 9 y 
Z 
where y is the direction of the magnetic field experienced t” d 
by the second wire as shown in the Figure. The next step i 
is to write the velocity v in terms of the current I, in Figure 32: 


the second wire. We did this in Section 1.1 when we first 
introduced the idea of currents: if there’s a density n of these particles and each carries 
charge q, then the current density is 


J2 = nqv 


For a wire with cross-sectional area A, the total current is just I2 = JoA. For our 
set-up, Jo = Joz. 


Finally, we want to compute the force on the wire per unit length, f. Since the 
number of charges per unit length is nA and F is the force on each charge, we have 


O o kold R ae Lolilo A 
f = nAF = (ane ) ZxXy= (aan X (3.25) 


This is our answer for the force between two parallel wires. If the two currents are 
in the same direction, so that Jj > 0, the overall minus sign means that the force 
between two wires is attractive. For currents in opposite directions, with [Ip < 0, the 
force is repulsive. 


The General Force Between Currents 


We can extend our discussion to the force experienced between two current distributions 
J, and Jz. We start by considering the magnetic field B(r) due to the first current J4. 
As we’ve seen, the Biot-Savart law (3.15) tells us that this can be written as 


B(r) = £ fer Tess) 


«An jr — r'|’ 


If the current Jı is localised on a curve C4, then we can replace this volume integral 
with the line integral (3.16) 


B(r) = Moly $ dr, x (r = rı) 
C1 


An jr — rl’ 
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Now we place a second current distribution Jy in this magnetic field. It experiences a 
force per unit area given by (1.3), so the total force is 


F = per Jo(r) x Bir) (3.26) 


Again, if the current J» is restricted to lie on a curve Co, then this volume integral can 
be replaced by the line integral 


F=f dr x B(r) 
C2 


and the force can now be expressed as a double line integral, 


F= torr g $ dry x (an x a) 
4r Cı J C2 [r2 = r| 


In general, this integral will be quite tricky to perform. However, if the currents are 
localised, and well-separated, there is a somewhat better approach where the force can 
be expressed purely in terms of the dipole moment of the current. 


3.4.2 Force and Energy for a Dipole 


We start by asking a slightly different question. We’ll forget about the second current 
and just focus on the first: call it J(r). We’ll place this current distribution in a 
magnetic field B(r) and ask: what force does it feel? 


In general, there will be two kinds of forces. There will be a force on the centre of 
mass of the current distribution, which will make it move. There will also be a torque 
on the current distribution, which will want to make it re-orient itself with respect to 
the magnetic field. Here we’re going to focus on the former. Rather remarkably, we’ll 
see that we get the answer to the latter for free! 


The Lorentz force experienced by the current distribution is 


re f dr I(r) x B(r) 
v 
We’re going to assume that the current is localised in some small region r = R and 


that the magnetic field B varies only slowly in this region. This allows us to Taylor 
expand 
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We then get the expression for the force 
F = —B(R) x f dr J(r) +f d’r J(r) x [(r- V)B(R)| +... 
V V 


The first term vanishes because the currents have to go around in loops; we’ve already 
seen a proof of this following equation (3.21). We’re going to do some fiddly manipula- 
tions with the second term. To help us remember that the derivative V is acting on B, 
which is then evaluated at R, we’ll introduce a dummy variable r’ and write the force 
as 


F= / dr J(r) x [(r- V)B(r’)| (3.27) 
v 


r'=R 
Now we want to play around with this. First, using the fact that V x B = 0 in the 
vicinity of the second current, we’re going to show, that we can rewrite the integrand 
as 


J(r) x [@- VB) = -V' x [r : Br’) J@)] 


To see why this is true, it’s simplest to rewrite it in index notation. After shuffling a 
couple of indices, what we want to show is: 


€ijnd (7) ri ODT) = Cie gh) ri O;,Bi(r’) 
Or, subtracting one from the other, 
eigkdj(r) ri (By (r’) — O,,Bi(r')) = 0 


But the terms in the brackets are the components of V x B and so vanish. So our 
result is true and we can rewrite the force (3.27) as 


F=-V x [er (r Bir) ) Jr) 


r/=R 
Now we need to manipulate this a little more. We make use of the identity (3.23) where 


we replace the constant vector by B. Thus, up to some relabelling, (3.23) is the same 
as 


1 
J dr BIB | drIxr=-Bxm 
v 2 V 


where m is the magnetic dipole moment of the current distribution. Suddenly, our 
expression for the force is looking much nicer: it reads 


F = V x (B x m) 


— 60 — 


where we’ve dropped the r’ = R notation because, having lost the integral, there’s no 
cause for confusion: the magnetic dipole m is a constant, while B varies in space. Now 
we invoke a standard vector product identity. Using V-B = 0, this simplifies and we’re 
left with a simple expression for the force on a dipole 


F=V(B-m) (3.28) 


After all that work, we’re left with something remarkably simple. Moreover, like many 
forces in Newtonian mechanics, it can be written as the gradient of a function. This 
function, of course, is the energy U of the dipole in the magnetic field, 


U=-B-m (3.29) 


This is an important expression that will play a role in later courses in Quantum 
Mechanics and Statistical Physics. For now, we’ll just highlight something clever: we 
derived (3.29) by considering the force on the centre of mass of the current. This is 
related to how U depends on r. But our final expression also tells us how the energy 
depends on the orientation of the dipole m at fixed position. This is related to the 
torque. Computing the force gives us the torque for free. This is because, ultimately, 
both quantities are derived from the underlying energy. 


The Force Between Dipoles 


As a particular example of the force (3.28), consider the case where the magnetic field 
is set up by a dipole mı. We know that the resulting long-distance magnetic field is 
(3.24), 


B(r) = T (= _ =) (3.30) 


Now we’ll consider how this affects the second dipole m = mg. From (3.28), we have 


pe log (= 2) (p+) = m; « =) 


An r3 


where r is the vector from m; to mg. Note that the structure of the force is identical 
to that between two electric dipoles in (2.30). This is particularly pleasing because 
we used two rather different methods to calculate these forces. If we act with the 
derivative, we have 


Arr4 


[(m; - ĉ)ms + (mg - ĉ)m; + (mı -my)F —5(m,-F)(m2-f)F] (3.31) 
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First note that if we swap m; and mg, so that we also send r + —r, then the force 
swaps sign. This is a manifestation of Newton’s third law: every action has an equal 
and opposite reaction. Recall from Dynamics and Relativity lectures that we needed 
Newton’s third law to prove the conservation of momentum of a collection of particles. 
We see that this holds for a bunch of dipoles in a magnetic field. 


But there was also a second part to Newton’s third law: to prove the conservation 
of angular momentum of a collection of particles, we needed the force to lie parallel to 
the separation of the two particles. And this is not true for the force (3.31). If you set 
up a collection of dipoles, they will start spinning, seemingly in contradiction of the 
conservation of angular momentum. What’s going on?! Well, angular momentum is 
conserved, but you have to look elsewhere to see it. The angular momentum carried 
by the dipoles is compensated by the angular momentum carried by the magnetic field 
itself. 


Finally, a few basic comments: the dipole force drops off as 1/r4, quicker than the 
Coulomb force. Correspondingly, it grows quicker than the Coulomb force at short 
distances. If mı and m, point in the same direction and lie parallel to the separation 
R, then the force is attractive. If mı and m, point in opposite directions and lie 
parallel to the separation between them, then the force is repulsive. The expression 
(3.31) tells us the general result. 


3.4.3 So What is a Magnet? 


Until now, we’ve been talking about the magnetic field 
associated to electric currents. But when asked to en- 
visage a magnet, most people would think if a piece of 
metal, possibly stuck to their fridge, possibly in the form 
of a bar magnet like the one shown in the picture. How 
are these related to our discussion above? 


These metals are permanent magnets. They often in- 


Figure 33: 


volve iron. They can be thought of as containing many 
microscopic magnetic dipoles, which align to form a large 
magnetic dipole M. In a bar magnet, the dipole M points between the two poles. The 
iron filings in the picture trace out the magnetic field which takes the same form that 
we saw for the current loop in Section 3.3. 


This means that the leading force between two magnets is described by our result 
(3.31). Suppose that Mı, Mp and the separation R all lie along a line. If M; and M3 
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point in the same direction, then the North pole of one magnet faces the South pole 
of another and (3.31) tells us that the force is attractive. Alternatively, if Mı and M2 
point in opposite directions then two poles of the same type face each other and the 
force is repulsive. This, of course, is what we all learned as kids. 


The only remaining question is: where do the microscopic dipole moments m come 
from? You might think that these are due to tiny electric atomic currents but this 
isn’t quite right. Instead, they have a more fundamental origin. The electric charges 
— which are electrons — possess an inherent angular momentum called spin. Roughly 
you can think of the electron as spinning around its own axis in much the same way as 
the Earth spins. But, ultimately, spin is a quantum mechanical phenomenon and this 
classical analogy breaks down when pushed too far. The magnitude of the spin is: 


1 
s=-ħ 
2 
where, recall, A has the same dimensions as angular momentum. 


We can push the classical analogy of spin just a little further. Classically, an electri- 
cally charged spinning ball would give rise to a magnetic dipole moment. So one may 
wonder if the spinning electron also gives rise to a magnetic dipole. The answer is yes. 
It is given by 

.,8 
m= 95s 
where e is the charge of the electron and m is its mass. The number g is dimensionless 
and called, rather uninspiringly, the g-factor. It has been one of the most important 
numbers in the history of theoretical physics, with several Nobel prizes awarded to 
people for correctly calculating it! The classical picture of a spinning electron suggests 
g = 1. But this is wrong. The first correct prediction (and, correspondingly, first Nobel 
prize) was by Dirac. His famous relativistic equation for the electron gives 


g=2 


Subsequently it was observed that Dirac’s prediction is not quite right. The value of g 
receives corrections. The best current experimental value is 


g = 2.00231930419922 + (1.5 x 107") 


Rather astonishingly, this same value can be computed theoretically using the frame- 
work of quantum field theory (specifically, quantum electrodynamics). In terms of 
precision, this is one of the great triumphs of theoretical physics. 
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There is much much more to the story of magnetism, not least what causes the 
magnetic dipoles m to align themselves in a material. The details involve quantum 
mechanics and are beyond the scope of this course. 


3.5 Units of Electromagnetism 


More than any other subject, electromagnetism is awash with different units. In large 
part this is because electromagnetism has such diverse applications and everyone from 
astronomers, to electrical engineers, to particle physicists needs to use it. But it’s still 
annoying. Here we explain the basics of SI units. 


The SI unit of charge is the Coulomb. As of 20197, the Coulomb is defined in terms 
of the charge —e carried by the electron. This is taken to be exactly 


e = 1.602176634 x 10°" C 


If you rub a balloon on your sweater, it picks up a charge of around 1076 C or so. 
A bolt of lightening deposits a charge of about 15 C. The total charge that passes 
through an AA battery in its lifetime is about 5000 C. 


The SI unit of current is the Ampere, denoted A. It is defined as one Coulomb of 
charge passing every second. The current that runs through single ion channels in 
cell membranes is about 1071? A. The current that powers your toaster is around 
1 Ato 10 A. There is a current in the Earth’s atmosphere, known as the Birkeland 
current, which creates the aurora and varies between 10° A and 10° A. Galactic size 
currents in so-called Seyfert galaxies (particularly active galaxies) have been measured 
at a whopping 10!° A. 


The electric field is measured in units of NC~!. The electrostatic potential œ has 
units of Volts, denoted V, where the 1 Volt is the potential difference between two 
infinite, parallel plates, separated by 1 m, which create an electric field of 1 NC. 


Prior to 2019, a reluctance to rely on fundamental physics meant that the definitions were a little 
more tortuous. The Ampere was taken to be the base unit, and the Coulomb was defined as the 
amount of charge transported by a current of 1 A in a second. The Ampere, in turn, was defined to 
be the current carried by two straight, parallel wires when separated by a distance of 1 m, in order 
to experience an attractive force-per-unit-length of 2 x 107’ Nmt. (Recall that a Newton is the 
unit of force needed to accelerate 1 Kg at 1 ms~+.) From our result (3.25), we see that if we plug in 
I, = Ip =1 Aand d= 1 m then this force is f = 9/27 A?m~!. This definition is the reason that uo 
has the strange-looking value po = 4r x 107-7 m KgC~?. The new definitions of SI units means that 
we can no longer say with certainty that po = 4r x 1077 m KgC™?, but this only holds up to the 
experimental accuracy of a dozen significant figures or so. For our purposes, the main lesson to draw 
from this is that, from the perspective of fundamental physics, SI units are arbitrary and a little daft. 
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A nerve cell sits at around 107? V. An AA battery sits at 1.5 V. The largest man- 
made voltage is 10’ V produced in a van der Graaf generator. This doesn’t compete 
well with what Nature is capable of. The potential difference between the ends of a 
lightening bolt can be 108 V. The voltage around a pulsar (a spinning neutron star) 
can be 101° V. 


The unit of a magnetic field is the Tesla, denoted T. A particle of charge 1 C, passing 


1 will experience a force of 1 N. From the 


through a magnetic field of 1 T at 1 ms- 
examples that we’ve seen above it’s clear that 1 C is a lot of charge. Correspondingly, 
1 T is a big magnetic field. Our best instruments (SQUIDs) can detect changes in 
magnetic fields of 10718 T. The magnetic field in your brain is 1071? T. The strength 
of the Earth’s magnetic field is around 10~° T while a magnet stuck to your fridge has 
about 107° T. The strongest magnetic field we can create on Earth is around 100 T. 
Again, Nature beats us quite considerably. The magnetic field around neutron stars can 
be between 10° T and 10° T. (There is an exception here: in “heavy ion collisions”, 
in which gold or lead nuclei are smashed together in particle colliders, it is thought 
that magnetic fields comparable to those of neutron stars are created. However, these 
magnetic fields are fleeting and small. They are stretch over the size of a nucleus and 
last for a millionth of a second or so). 


As the above discussion amply demonstrates, SI units are based entirely on historical 
convention rather than any deep underlying physics. A much better choice is to pick 
units of charge such that we can discard €9 and uo. There are two commonly used 
frameworks that do this, called Lorentz-Heaviside units and Gaussian units. I should 
warn you that the Maxwell equations take a slightly different form in each. 


To fully embrace natural units, we should also set the speed of light c = 1. (See 
the rant in the Dynamics and Relativity lectures). However we can’t set everything 
to one. There is one combination of the fundamental constants of Nature which is 
dimensionless. It is known as the fine structure constant, 


e? 


a = ——— 
4Teoħc 
and takes value a ~ 1/137. Ultimately, this is the correct measure of the strength of 
the electromagnetic force. It tells us that, in units with ¢7 = A = c = 1, the natural, 
dimensionless value of the charge of the electron is e ~ 0.3. 


3.5.1 A History of Magnetostatics 


The history of magnetostatics, like electrostatics, starts with the Greeks. The fact 
that magnetic iron ore, sometimes known as “lodestone”, can attract pieces of iron was 
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apparently known to Thales. He thought that he had found the soul in the stone. The 
word “magnetism” comes from the Greek town Magnesia, which is situated in an area 
rich in lodestone. 


It took over 1500 years to turn Thales’ observation into something useful. In the 11‘ 
century, the Chinese scientist Shen Kuo realised that magnetic needles could be used 
to build a compass, greatly improving navigation. 


The modern story of magnetism begins, as with electrostatics, with William Gilbert. 
From the time of Thales, it had been thought that electric and magnetic phenomenon 
are related. One of Gilbert’s important discoveries was, ironically, to show that this is 
not the case: the electrostatic forces and magnetostatic forces are different. 


Yet over the next two centuries, suspicions remained. Several people suggested that 
electric and magnetic phenomena are intertwined, although no credible arguments were 
given. The two just smelled alike. The following unisightful quote from Henry Elles, 
written in 1757 to the Royal Society, pretty much sums up the situation: “There are 
some things in the power of magnetism very similar to those of electricity. But I do 
not by any means think them the same”. A number of specific relationships between 
electricity and magnetism were suggested and all subsequently refuted by experiment. 


When the breakthrough finally came, it took everyone by surprise. In 1820, the Dan- 
ish scientist Hans Christian Ørsted noticed that the needle on a magnet was deflected 
when a current was turned on or off. After that, progress was rapid. Within months, 
Ørsted was able to show that a steady current produces the circular magnetic field 
around a wire that we have seen in these lectures. In September that year, Wrsted’s 
experiments were reproduced in front of the French Academy by Francois Arago, a talk 
which seemed to mobilise the country’s entire scientific community. First out of the 
blocks were Jean-Baptiste Biot and Félix Savart who quickly determined the strength 
of the magnetic field around a long wire and the mathematical law which bears their 
name. 


Of those inspired by Arago’s talk, the most important was André-Marie Ampère. 
Skilled in both experimental and theoretical physics, Ampère determined the forces 
that arise between current carrying wires and derived the mathematical law which 
now bears his name: $ B -dr = pol. He was also the first to postulate that there 
exists an atom of electricity, what we would now call the electron. Ampére’s work was 
published in 1827 a book with the catchy title “Memoir on the Mathematical Theory 
of Electrodynamic Phenomena, Uniquely Deduced from Experience”. It is now viewed 
as the beginning of the subject of electrodynamics. 
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4. Electrodynamics 


For static situations, Maxwell’s equations split into the equations of electrostatics, (2.1) 
and (2.2), and the equations of magnetostatics, (3.1) and (3.2). The only hint that there 
is a relationship between electric and magnetic fields comes from the fact that they are 
both sourced by charge: electric fields by stationary charge; magnetic fields by moving 
charge. In this section we will see that the connection becomes more direct when things 
change with time. 


4.1 Faraday’s Law of Induction 


“I was at first almost frightened when I saw such mathematical force made 
to bear upon the subject, and then wondered to see that the subject stood 
it so well.” 


Faraday to Maxwell, 1857 


One of the Maxwell equations relates time varying magnetic fields to electric fields, 


OB 


This equation tells us that if you change a magnetic field, you'll create an electric field. 
In turn, this electric field can be used to accelerate charges which, in this context, is 
usually thought of as creating a current in wire. The process of creating a current 
through changing magnetic fields is called induction. 


We’ll consider a wire to be a conductor, stretched along 
a stationary, closed curve, C, as shown in the figure. We will 
refer to closed wires of this type as a “circuit”. We integrate 
both sides of (4.1) over a surface S which is bounded by C, 


3B 
[5D as=- a 


By Stokes theorem, we can write this as 


OB d 
E-dr=-— GS-7 [Beas 
f g Ot dt Jg 


Recall that the line integral around C should be in the right-handed sense; if the fingers 


Figure 34: 


on your right-hand curl around C then your thumb points in the direction of dS. (This 
means that in the figure dS points in the same direction as B). To get the last equality 
above, we need to use the fact that neither C nor S change with time. Both sides 


6 /.= 


of this equation are usually given names. The integral of the electric field around the 
curve C is called the electromotive force, €, or emf for short, 


E= [Beir 
c 


It’s not a great name because the electromotive force is not really a force. Instead 
it’s the tangential component of the force per unit charge, integrated along the wire. 
Another way to think about it is as the work done on a unit charge moving around the 
curve C’. If there is a non-zero emf present then the charges will be accelerated around 
the wire, giving rise to a current. 


The integral of the magnetic field over the surface S is called the magnetic flux & 


through S, 
ğ = f B-dS 
S 


The Maxwell equation (4.1) can be written as 
E= -— (4.2) 


In this form, the equation is usually called Faraday’s Law. Sometimes it is called the 
flux rule. 


Faraday’s law tells us that if you change the magnetic flux through S then a current 
will flow. There are a number of ways to change the magnetic field. You could simply 
move a bar magnet in the presence of circuit, passing it through the surface S; or you 
could replace the bar magnet with some other current density, restricted to a second 
wire C”, and move that; or you could keep the second wire C” fixed and vary the current 
in it, perhaps turning it on and off. All of these will induce a current in C. 


However, there is then a secondary effect. When a current flows in C, it will create 
its own magnetic field. We’ve seen how this works for steady currents in Section 3. This 
induced magnetic field will always be in the direction that opposes the change. This 
is called Lenz’s law. If you like, “Lenz’s law” is really just the minus sign in Faraday’s 
law (4.2). 
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We can illustrate this with a simple example. Con- 
sider the case where C is a circle, lying in a plane. We’ll 
place it in a uniform B field and then make B smaller < re 
over time, so ® < 0. By Faraday’s law, € > 0 and the Q | | Q) 
current will flow in the right-handed direction around | = | 
C as shown. But now you can wrap your right-hand 
in a different way: point your thumb in the direction Figure 35: Lenz’s law 
of the current and let your fingers curl to show you the 
direction of the induced magnetic field. These are the circles drawn in the figure. You 


see that the induced current causes B to increase inside the loop, counteracting the 
original decrease. 


Lenz’s law is rather like a law of inertia for magnetic fields. It is necessary that 
it works this way simply to ensure energy conservation: if the induced magnetic field 
aided the process, we’d get an unstable runaway situation in which both currents and 
magnetic fields were increasing forever. 


4.1.1 Faraday’s Law for Moving Wires 


There is another, related way to induce cur- 
rents in the presence of a magnetic field: you can B ie 
keep the field fixed, but move the wire. Perhaps a Z 
the simplest example is shown in the figure: it’s ER 
a rectangular circuit, but where one of the wires 
is a metal bar that can slide backwards and for- | | | 


wards. This whole set-up is then placed in a 


magnetic field, which passes up, perpendicular 


2 8 Figure 36: Moving circuit 
through the circuit. 


Slide the bar to the left with speed v. Each charge q in the bar experiences a Lorentz 
force quB, pushing it in the y direction. This results in an emf which, now, is defined 
as the integrated force per charge. In this case, the resulting emf is 


E=vBd 


where d is the length of the moving bar. But, because the area inside the circuit 
is getting smaller, the flux through C is also decreasing. In this case, it’s simple to 
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compute the change of flux: it is 


dö 
8d 
a = 


We see that once again the change of flux is related to the emf through the flux rule 


Note that this is the same formula (4.2) that we derived previously, but the physics 
behind it looks somewhat different. In particular, we used the Lorentz force law and 
didn’t need the Maxwell equations. 


As in our previous example, the emf will drive a current around the loop C. And, 
just as in the previous example, this current will oppose the motion of the bar. In this 
case, it is because the current involves charges moving with some speed u around the 
circuit. These too feel a Lorentz force law, now pushing the bar back to the right. This 
means that if you let the bar go, it will not continue with constant speed, even if the 
connection is frictionless. Instead it will slow down. This is the analog of Lenz’s law in 
the present case. We’ll return to this example in Section 4.1.3 and compute the bar’s 
subsequent motion. 


C(t+8t) 


The General Case 


There is a nice way to include both the effects of time- 
dependent magnetic fields and the possibility that the 
circuit C changes with time. We consider the moving 
loop C(t), as shown in the figure. Now the change in 


C(t) 


flux through a surface S has two terms: one because B Figure 37: Moving Circuits 
may be changing, and one because C is changing. In a 
small time dt, we have 


5 = D(t + öt) — H(t) = i B(t + ôt) -dS — f B(t) - dS 


(t) 
B 
=) 5 ot a+ | f -f | B0 -a8 + 00) 
s(t) Ot s(t+ét) J S(t) 


We can do something with the middle terms. Consider the closed surface created by 
S(t) and S(t + ôt), together with the cylindrical region swept out by C(t) which we 
call Se. Because V -B = 0, the integral of B(t) over any closed surface vanishes. But 
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Sscexsy — Ís is the top and bottom part of the closed surface, with the minus sign 
just ensuring that the integral over the bottom part S(t) is in the outward direction. 
This means that we must have 


Tocco p [| Bi) - dS = — f, B(t)-ds 


For the integral over Se, we can write the surface element as 
dS = (dr x v)ot 


where dr is the line element along C(t) and v is the velocity of a point on C. We find 
that the expression for the change in flux can be written as 


ate E Zas- | (v x B)-dr 
dt 6t30 Ot S(t) Ot C(t) 


where we’ve taken the liberty of rewriting (dr x v)-B = dr - (v x B). Now we use the 
Maxwell equation (4.1) to rewrite the OB/0t in terms of the electric field. This gives 
us our final expression 


where the right-hand side now includes the force tangential to the wire from both 
electric fields and also from the motion of the wire in the presence of magnetic fields. 
The electromotive force should be defined to include both of these contributions, 


E= f (B+vxB)-dr 
€ 
and we once again get the flux rule € = —d®/dt. 


4.1.2 Inductance and Magnetostatic Energy 


In Section 2.3, we computed the energy stored in the electric field by considering the 
work done in building up a collection of charges. But we didn’t repeat this calculation 
for the magnetic field in Section 3. The reason is that we need the concept of emf to 
describe the work done in building up a collection of currents. 


Suppose that a constant current J flows along some curve C. From the results of 
Section 3 we know that this gives rise to a magnetic field and hence a flux ® = f 5B-dS 
through the surface S bounded by C. Now increase the current J. This will increase 
the flux ®. But we’ve just learned that the increase in flux will, in turn, induce an emf 
around the curve C. The minus sign of Lenz’s law ensures that this acts to resist the 
change of current. The work needed to build up a current is what’s needed to overcome 
this emf. 


=F = 


Inductance 


If a current J flowing around a curve C gives rise to a flux ®@ = f gB -dS then the 
inductance L of the circuit is defined to be 


p 
I 


The inductance is a property only of our choice of curve C. 


An Example: The Solenoid 


A solenoid consists of a cylinder of length l and cross-sectional area A. B 
We take | > VA so that any end-effects can be neglected. A wire 
wrapped around the cylinder carries current J and winds N times per 

unit length. We previously computed the magnetic field through the 

centre of the solenoid to be (3.7) 


B=oIN 


This means that a flux through a single turn is ®ọ = oI NA. The 


solenoid consists of N/ turns of wire, so the total flux is Figure 38: 


® = mI N? Al = po I N?V 
with V = Al the volume inside the solenoid. The inductance of the solenoid is therefore 
L = mN’ V 


Magnetostatic Energy 


The definition of inductance is useful to derive the energy stored in the magnetic field. 
Let’s take our circuit C with current I. We’ll try to increase the current. The induced 
emf is 


od dt 


As we mentioned above, the induced emf can be thought of as the work done in moving 
a unit charge around the circuit. But we have current J flowing which means that, in 
time ôt, a charge /6t moves around the circuit and the amount of work done is 


dI d I Lar 
—jt => Ue u. = a 


OW. Seth EE dt dt 2 dt 
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The work needed to build up the current is just the opposite of this. Integrating over 
time, we learn that the total work necessary to build up a current J along a curve with 
inductance L is 


1 1 
W = -LI = -I9 
2 2 


Following our discussion for electric energy in (2.3), we identify this with the energy U 
stored in the system. We can write it as 


1 1 
U=31 f B-a8=31 f vx ads=51$ Acde=5 | čz3-A 
a de J. an 2 


where, in the last step, we’ve used the fact that the current density J is localised on 
the curve C to turn the integral into one over all of space. At this point we turn to the 
Maxwell equation V x B = uoJ to write the energy as 


1 1 

pa de (Y xB): A= z= [d's IV: (B x A) +B- (V x A) 
24o 2/0 

We assume that B and A fall off fast enough at infinity so that the first term vanishes. 

We’re left with the simple expression 


1 
U=-— | dc¢B-B 
2/0 
Combining this with our previous result (2.27) for the electric field, we have the energy 
stored in the electric and magnetic fields, 


g= | ds (SE-B+>-B-B) (4.3) 
u 


This is a nice result. But there’s something a little unsatisfactory behind our derivation 
of (4.3). First, we reiterate a complaint from Section 2.3: we had to approach the energy 
in both the electric and magnetic fields in a rather indirect manner, by focussing not 
on the fields but on the work done to assemble the necessary charges and currents. 
There’s nothing wrong with this, but it’s not a very elegant approach and it would be 
nice to understand the energy directly from the fields themselves. One can do better 
by using the Lagrangian approach to Maxwell’s equations which we turn to in Section 
5.6 


Second, we computed the energy for the electric fields and magnetic fields alone and 
then simply added them. We can’t be sure, at this point, that there isn’t some mixed 
contribution to the energy such as E- B. It turns out that there are no such terms. 
Again, we’ll postpone a proof of this until Section 5.6. 
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4.1.3 Resistance 


You may have noticed that our discussion above has been a little qualitative. If the 
flux changes, we have given expressions for the induced emf € but we have not given 
an explicit expression for the resulting current. And there’s a good reason for this: it’s 
complicated. 


The presence of an emf means that there is a force on the charges in the wire. And 
we know from Newtonian mechanics that a force will cause the charges to accelerate. 
This is where things start to get complicated. Accelerating charges will emit waves of 
electromagnetic radiation, a process that you will explore later. Relatedly, there will 
be an opposition to the formation of the current through the process that we’ve called 
Lenz’s law. 


So things are tricky. What’s more, in real wires and materials there is yet another 
complication: friction. Throughout these lectures we have modelled our charges as if 
they are moving unimpeded, whether through the vacuum of space or through a con- 
ductor. But that’s not the case when electrons move in real materials. Instead, there’s 
stuff that gets in their way: various messy impurities in the material, or sound waves 
(usually called phonons in this context) which knock them off-course, or even other 
electrons. All these effects contribute to a friction force that acts on the moving elec- 
trons. The upshot of this is that the electrons do not accelerate forever. In fact, they 
do not accelerate for very long at all. Instead, they very quickly reach an equilibrium 
speed, analogous to the “terminal velocity” that particles reach when falling in grav- 
itational field while experiencing air resistance. In many circumstances, the resulting 
current I is proportional to the applied emf. This relationship is called Ohm’s law. It 
is 


E=IR (4.4) 


The constant of proportionality R is called the resistance. The emf is E = f E- dx. If 
we write E = —Vẹọ, then € = V, the potential difference between two ends of the wire. 
This gives us the version of Ohm’s law that is familiar from school: V = IR. 


The resistance R depends on the size and shape of the wire. If the wire has length 
L and cross-sectional area A, we define the resistivity as p = AR/L. (It’s the same 
Greek letter that we earlier used to denote charge density. They’re not the same thing. 
Sorry for any confusion!) The resistivity has the advantage that it’s a property of 
the material only, not its dimensions. Alternatively, we talk about the conductivity 
o = 1/p. (This is the same Greek letter that we previously used to denote surface 
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charge density. They’re not the same thing either.) The general form of Ohm’s law is 
then 


J =oE 


Unlike the Maxwell equations, Ohm’s law does not represent a fundamental law of 
Nature. It is true in many, perhaps most, materials. But not all. There is a very 
simple classical model, known as the Drude model, which treats electrons as billiard 
balls experiencing linear drag which gives rise to Ohm’s law.. But a proper derivation of 
Ohm’s law needs quantum mechanics and a more microscopic understanding of what’s 
happening in materials. Needless to say, this is (way) beyond the scope of this course. 
So, at least in this small section, we will take Ohm’s law (4.4) as an extra input in our 
theory. 


When Ohm’s law holds, the physics is very different. Now the applied force (or, 
in this case, the emf) is proportional to the velocity of the particles rather than the 
acceleration. It’s like living in the world that Aristotle envisaged rather than the one 
Galileo understood. But it also means that the resulting calculations typically become 
much simpler. 


An Example 


Let’s return to our previous example of a slid- 

ing bar of length d and mass m which forms a B au. 

circuit, sitting in a magnetic field B = Bz. But T Z 
now we will take into account the effect of elec- ZA 
trical resistance. We take the resistance of the x 
sliding bar to be R. But we’ll make life easy for 
ourselves and assume that the resistance of the | | | 


rest of the circuit is negligible. Figure 39: 


There are two dynamical degrees of freedom 
in our problem: the position x of the sliding bar and the current J that flows around the 
circuit. We take J > 0 if the current flows along the bar in the positive y direction. The 
Lorentz force law tells us that the force on a small volume of the bar is F = IB ŷ x z. 
The force on the whole bar is therefore 


F = I Bdà 
The equation of motion for the position of the wire is then 


mł = I Bd 


— p= 


Now we need an equation that governs the current I(t). If the total emf around the 
circuit comes from the induced emf, we have 


Ohm/’s law tells us that € = IR. Combining these, we get a simple differential equation 
for the position of the bar 


Be 
— 7 


R 


which we can solve to see that any initial velocity of the bar, v, decays exponentially: 


mr = 


a(t) = _ye BH t/mR 


Note that, in this calculation we neglected the magnetic field created by the current. 
It’s simple to see the qualitative effect of this. If the bar moves to the left, so & < 0, then 
the flux through the circuit decreases. The induced current is J > 0 which increases B 
inside the circuit which, in accord with Lenz’s law, attempts to counteract the reduced 
flux. 


In the above derivation, we assumed that the total emf around the circuit was pro- 
vided by the induced emf. This is tantamount to saying that no current flows when the 
bar is stationary. But we can also relax this assumption and include in our analysis an 
emf E across the circuit (provided, for example, by a battery) which induces a current 
Ip = Eod / R. Now the total emf is 


c= Eo + Einduced = Eo — Bdi 


The total current is again given by Ohms law J = €/R. The position of the bar is now 
governed by the equation 


Again, it’s simple to solve this equation. 


Joule Heating 


In Section 4.1.2, we computed the work done in changing the current in a circuit C. 
This ignored the effect of resistance. In fact, if we include the resistance of a wire then 
we need to do work just to keep a constant current. This should be unsurprising. It’s 
the same statement that, in the presence of friction, we need to do work to keep an 
object moving at a constant speed. 
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Let’s return to a fixed circuit C. As we mentioned above, if a battery provides an 
emf Ep, the resulting current is J = €)/R. We can now run through arguments similar 
to those that we saw when computing the magnetostatic energy. The work done in 
moving a unit charge around C is €) which means that amount of work necessary to 
keep a current J moving for time ot is 


OW = Eplét = I'R ôt 


We learn that the power (work per unit time) dissipated by a current passing through 
a circuit of resistance R is dW/dt = I?R. This is not energy that can be usefully stored 
like the magnetic and electric energy (4.3); instead it is lost to friction which is what 
we call heat. (The difference between heat and other forms of energy is explained in 
the Thermodynamics section in the Statistical Physics notes). The production of heat 
by a current is called Joule heating or, sometimes, Ohmic heating. 


4.1.4 Michael Faraday (1791-1867) 


“The word “physicist” is both to my mouth and ears so awkward that 
I think I shall never be able to use it. The equivalent of three separate 
sounds of “s” in one word is too much.” 


Faraday in a letter to William Whewell 3 


Michael Faraday’s route into science was far from the standard one. The son 
of a blacksmith, he had little schooling and, at the age of 14, was apprenticed to a 
bookbinder. There he remained until the age of 20 when Faraday attended a series of 
popular lectures at the Royal Institution by the chemist Sir Humphry Davy. Inspired, 
Faraday wrote up these lectures, lovingly bound them and presented them to Davy as 
a gift. Davy was impressed and some months later, after suffering an eye injury in an 
explosion, turned to Faraday to act as his assistant. 


Not long after, Davy decided to retire and take a two-year leisurely tour of Europe, 
meeting many of the continent’s top scientists along the way. He asked Faraday to join 
him and his wife, half as assistant, half as valet. The science part of this was a success; 
the valet part less so. But Faraday dutifully played his roles, emptying his master’s 
chamber pot each morning, while aiding in a number of important scientific discoveries 
along the way, including a wonderful caper in Florence where Davy and Faraday used 
Galileo’s old lens to burn a diamond, reducing it, for the first time, to Carbon. 


3 According to the rest of the internet, Faraday complains about three separate sounds of “i”. The 
rest of the internet is wrong and can’t read Faraday’s writing. The original letter is in the Wren library 
in Trinity College and is shown on the next page. I’m grateful to Frank James, editor of Faraday’s 
correspondence, for help with this. 
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Back in England, Faraday started work at the 
Royal Institution. He would remain there for over 
45 years. An early attempt to study electricity and 


magnetism was abandoned after a priority dispute tem 
with his former mentor Davy and it was only after ee da aa 
Davy’s death in 1829 that Faraday turned his at- a a 
tentions fully to the subject. He made his discovery peor erg worl LM. : 
of induction on 28 October, 1831. The initial ex- Vpro k vm Ble A s 
periment involved two, separated coils of wire, both a Ze tt ti 
wrapped around the same magnet. Turning on a ^ > mo s I 
current in one wire induces a momentary current in | ⁄ 4 pae niia A 
the second. Soon after, he found that a current is -ga i E E E 
also induced by passing a loop of wire over a mag- f MC: fo 
net. The discovery of induction underlies the elec- pferdc sobs fF wh eee 


trical dynamo and motor, which convert mechanical a Garonne y pe 
. . A A EA fona h hhn O 

energy into electrical energy and vice-versa. 4 

Faraday was not a great theorist and the mathe- Figure 40: 

matical expression that we have called Faraday’s law 

is due to Maxwell. Yet Faraday’s intuition led him to make one of the most important 

contributions of all time to theoretical physics: he was the first to propose the idea of 

the field. 


As Faraday’s research into electromagnetism increased, he found himself lacking the 
vocabulary needed to describe the phenomena he was seeing. Since he didn’t exactly 
receive a classical education, he turned to William Whewell, then Master of Trinity, 


for some advice. Between them, they cooked up the words ‘anode’, ‘cathode’, ‘ion’, 
‘dielectric’, ‘diamagnetic’ and ‘paramagnetic’. They also suggested the electric charge 


be renamed ‘Franklinic’ in honour of Benjamin Franklin. That one didn’t stick. 


The last years of Faraday’s life were spent in the same way as Einstein: seeking a 
unified theory of gravity and electromagnetism. The following quote describes what is, 
perhaps, the first genuine attempt at unification: 


Gravity: Surely his force must be capable of an experimental relation to 
Electricity, Magnetism and the other forces, so as to bind it up with them 
in reciprocal action and equivalent effect. Consider for a moment how to 
set about touching this matter by facts and trial ... 


Faraday, 19°” March, 1849. 
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As this quote makes clear, Faraday’s approach to this problem includes something 
that Einstein’s did not: experiment. Ultimately, neither of them found a connection 
between electromagnetism and gravity. But it could be argued that Faraday made the 
more important contribution: while a null theory is useless, a null experiment tells you 
something about Nature. 


4.2 One Last Thing: The Displacement Current 
We’ve now worked our way through most of the Maxwell equations. We’ve looked at 
Gauss’ law (which is really equivalent to Coulomb’s law) 


Vibe (4.5) 
€0 


and the law that says there are no magnetic monopoles 


V-B=0 (4.6) 
and Ampère’s law 
V x B= mJ (4.7) 
and now also Faraday’s law 
OB 
EL = 4.8 
VEB =8 (4.8) 


In fact, there’s only one term left to discuss. When fields change with time, there is an 
extra term that appears in Ampére’s law, which reads in full: 


E 
Vx B= Ho (3 + oT) (4.9) 


This extra term is called the displacement current. It’s not a great name because it’s 
not a current. Nonetheless, as you can see, it sits in the equation in the same place as 
the current which is where the name comes from. 


So what does this extra term do? Well, something quite remarkable. But before we 
get to this, there’s a story to tell you. 


The first four equations above (4.5), (4.6), (4.7) and (4.8) — which include Ampére’s 
law in unmodified form — were arrived at through many decades of painstaking ex- 
perimental work to try to understand the phenomena of electricity and magnetism. Of 
course, it took theoretical physicists and mathematicians to express these laws in the 
elegant language of vector calculus. But all the hard work to uncover the laws came 
from experiment. 
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The displacement current term is different. This was arrived at by pure thought 
alone. This is one of Maxwell’s contributions to the subject and, in part, why his 
name now lords over all four equations. He realised that the laws of electromagnetism 
captured by (4.5) to (4.8) are not internally consistent: the displacement current term 
has to be there. Moreover, once you add it, there are astonishing consequences. 


4.2.1 Why Ampere’s Law is Not Enough 


We’ll look at the consequences in the next section. But for now, let’s just see why the 
unmodified Ampére law (4.7) is inconsistent. We simply need to take the divergence 
to find 


woV-J=V-(VxB)=0 


This means that any current that flows into a given volume has to also flow out. But 
we know that’s not always the case. To give a simple example, we can imagine putting 
lots of charge in a small region and watching it disperse. Since the charge is leaving the 
central region, the current does not obey V -J = 0, seemingly in violation of Ampére’s 
law. 


There is a standard thought experiment involving cir- 


cuits which is usually invoked to demonstrate the need to 
amend Ampére’s law. This is shown in the figure. The idea | | 
is to cook up a situation where currents are changing over 
time. To do this, we hook it up to a capacitor — which can | 
be thought of as two conducting plates with a gap between R 


them — to a circuit of resistance R. The circuit includesa 1—5, N 
switch. When the switch is closed, the current will flow out 
of the capacitor and through the circuit, ultimately heating Figure 41: 


up the resistor. 


So what’s the problem here? Let’s try to compute the magnetic field created by the 
current at some point along the circuit using Ampére’s law. We can take a curve C that 
surrounds the wire and surface S with boundary C. If we chose S to be the obvious 
choice, cutting through the wire, then the calculation is the same as we saw in Section 
3.1. We have 


f B- dr = pol (4.10) 
C 


where J is the current through the wire which, in this case, is changing with time. 
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Figure 42: This choice of surface sug- Figure 43: This choice of surface sug- 
gests there is a magnetic field gests there is none. 


Suppose, however, that we instead decided to bound the curve C with the surface 
S’, which now sneaks through the gap between the capacitor plates. Now there is no 
current passing through 5S’, so if we were to use Ampére’s law, we would conclude that 
there is no magnetic field 


[B-ar=o (4.11) 
C 


This is in contradiction to our first calculation (4.10). So what’s going on here? Well, 
Ampère’s law only holds for steady currents that are not changing with time. And we’ve 
deliberately put together a situation where J is time dependent to see the limitations 
of the law. 


Adding the Displacement Current 


Let’s now see how adding the displacement current (4.9) fixes the situation. We’ll 
first look at the abstract issue that Ampère’s law requires V - J = 0. If we add the 
displacement current, then taking the divergence of (4.9) gives 


E 
m(v Itev E) =V.(V x B)=0 


But, using Gauss’s law, we can write €o V - E = p, so the equation above becomes 


which is the continuity equation that tells us that electric charge is locally conserved. 
It’s only with the addition of the displacement current that Maxwell’s equations become 
consistent with the conservation of charge. 
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Now let’s return to our puzzle of the circuit and capacitor. Without the displacement 
current we found that B = 0 when we chose the surface S” which passes between 
the capacitor plates. But the displacement current tells us that we missed something, 
because the build up of charge on the capacitor plates leads to a time-dependent electric 
field between the plates. For static situations, we computed this in (2.10): it is 


EA 
co A 


where A is the area of each plate and Q is the charge that sits on each plate, and we are 
ignoring the edge effects which is acceptable as long as the size of the plates is much 
bigger than the gap between them. Since Q is increasing over time, the electric field is 
also increasing 


OF 1d 1g 
ot gi dt eas 


So now if we repeat the calculation of B using the surface S’, we find an extra term 


OE 
Bedr= | poeot = pol 
[ ge 0 


This is the same answer (4.10) that we found using Ampére’s law applied to the surface 


S. 


from (4.9) which gives 


Great. So we see why the Maxwell equations need the extra term known as the 
displacement current. Now the important thing is: what do we do with it? As we’ll 
now see, the addition of the displacement current leads to one of the most wonderful 
discoveries in physics: the explanation for light. 


4.3 And There Was Light 


The emergence of light comes from looking for solutions of Maxwell’s equations in which 
the electric and magnetic fields change with time, even in the absence of any external 
charges or currents. This means that we’re dealing with the Maxwell equations in 


vacuum: 
E 
V-E=0 and V x B= poco 

OB 

.B= Ese 

V 0 and Vx Ji 


The essence of the physics lies in the two Maxwell equations on the right: if the electric 
field shakes, it causes the magnetic field to shake which, in turn, causes the electric 


= 82 — 


field to shake, and so on. To derive the equations governing these oscillations, we start 
by computing the second time derivative of the electric field, 


CE oð OB 
Hisp =a (VX B)= Vx =—-V x(V XE) (4.12) 


To complete the derivation, we need the identity 
Vx(VxE)=V(V-E)-VWE 


But, the first of Maxwell equations tells us that V - E = 0 in vacuum, so the first term 
above vanishes. We find that each component of the electric field satisfies, 


+, -VWE= (4.13) 


This is the wave equation. The speed of the waves, c, is given by 


1 
c= 4| — 
Lo€o 
Identical manipulations hold for the magnetic field. We have 


oB ð OE 1 1 
ee EEEE v 2a B) = —_V’B 
Je p (V x E) Vx Ai uo x (V x B) a 


where, in the last equality, we have made use of the vector identity (4.12), now applied 
to the magnetic field B, together with the Maxwell equation V -B = 0. We again find 
that each component of the magnetic field satisfies the wave equation, 
10°B 
eo ot 


The waves of the magnetic field travel at the same speed c as those of the electric field. 


—V’B=0 (4.14) 


What is this speed? At the very beginning of these lectures we provided the numerical 
values of the electric constant 


€o = 8.854187817 x 1071? m~? Kg"! s? C? 
and the magnetic constant, 
lo = 4r x 107’ m Kg O7? 
Plugging in these numbers gives the speed of electric and magnetic waves to be 
c = 299792458 ms™' 


But this is something that we’ve seen before. It’s the speed of light! This, of course, is 
because these electromagnetic waves are light. In the words of the man himself 
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“The velocity of transverse undulations in our hypothetical medium, calcu- 
lated from the electro-magnetic experiments of MM. Kohlrausch and Weber, 
agrees so exactly with the velocity of light calculated from the optical ex- 
periments of M. Fizeau, that we can scarcely avoid the inference that light 
consists in the transverse undulations of the same medium which is the 
cause of electric and magnetic phenomena” 


James Clerk Maxwell 


The simple calculation that we have just seen represents one of the most important 
moments in physics. Not only are electric and magnetic phenomena unified in the 
Maxwell equations, but now optics — one of the oldest fields in science — is seen to be 
captured by these equations as well. 


4.3.1 Solving the Wave Equation 


We've derived two wave equations, one for E and one for B. We can solve these 
independently, but it’s important to keep in our mind that the solutions must also 
obey the original Maxwell equations. This will then give rise to a relationship between 
E and B. Let’s see how this works. 


We’ll start by looking for a special class of solutions in which waves propagate in the 
x-direction and do not depend on y and z. These are called plane-waves because, by 
construction, the fields E and B will be constant in the (y, z) plane for fixed x and t. 


The Maxwell equation V -E = 0 tells us that we must have FE, constant in this case. 
Any constant electric field can always be added as a solution to the Maxwell equations 
so, without loss of generality, we’ll choose this constant to vanish. We look for solutions 
of the form 


E = (0, E(x, t), 0) 


where E satisfies the wave equation (4.13) which is now 


The most general solution to the wave equation takes the form 
E(x,t) = f(x — ct) + glx + ct) 


Here f(x— ct) describes a wave profile which moves to the right with speed c. (Because, 
as t increases, x also has to increase to keep f constant). Meanwhile, g(x + ct) describes 
a wave profile moving to the left with the speed c. 
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The most important class of solutions of this kind are those which oscillate with a 
single frequency w. Such waves are called monochromatic. For now, we’ll focus on 
the right-moving waves and take the profile to be the sine function. (We’ll look at 
the option to take cosine waves or other shifts of phase in a moment when we discuss 
polarisation). We have 


E = Egsin E (= — t) 
c 
We usually write this as 
E = Eosin (kx — wt) (4.15) 


where k is the wavenumber. The wave equation (4.13) requires that it is related to the 
frequency by 


w = ek? 


Equations of this kind, expressing frequency in terms of wavenumber, are called dis- 
persion relations. Because waves are so important in physics, there’s a whole bunch of 
associated quantities which we can define. They are: 


e The quantity w is more properly called the angular frequency and is taken to be 
positive. The actual frequency f = w/2a measures how often a wave peak passes 
you by. But because we will only talk about w, we will be lazy and just refer to 
this as frequency. 


e The period of oscillation is T = 27/w. 


e The wavelength of the wave is A = 27/k. This is the property of waves that 
you first learn about in kindergarten. The wavelength of visible light is between 
A ~ 3.9 x 10-7 m and 7 x 107-7 m. At one end of the spectrum, gamma rays 
have wavelength À ~ 1071? m and X-rays around \ ~ 1071? to 1078 m. At the 
other end, radio waves have A ~ 1 cm to 10 km. Of course, the electromagnetic 
spectrum doesn’t stop at these two ends. Solutions exist for all A. 


Although we grow up thinking about wavelength, moving forward the wavenum- 
ber k will turn out to be a more useful description of the wave. 


e Fois the amplitude of the wave. 
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So far we have only solved for the electric field. To determine the magnetic field, 
we use V- B = 0 to tell us that B, is constant and we again set Bẹ = 0. We know 
that the other components B, and B, must obey the wave equation (4.14). But their 
behaviour is dictated by what the electric field is doing through the Maxwell equation 
V x E = —B/ðt. This tells us that 


B = (0,0, B) 
with 
OB OE 
aa —k Eo cos(kx — wt) 
We find 
E 
B = = sin(kx — wt) (4.16) 


We see that the electric E and magnetic B fields oscillate in phase, but in orthogonal 
directions. And both oscillate in directions which are orthogonal to the direction in 
which the wave travels. 


all 
y~ 


Because the Maxwell equations are linear, we’re allowed to add any number of solu- 
tions of the form (4.15) and (4.16) and we will still have a solution. This sometimes 
goes by the name of the principle of superposition. (We mentioned it earlier when 
discussing electrostatics). This is a particularly important property in the context of 
light, because it’s what allow light rays travelling in different directions to pass through 
each other. In other words, it’s why we can see anything at all. 


The linearity of the Maxwell equations also encourages us to introduce some new 
notation which, at first sight, looks rather strange. We will often write the solutions 
(4.15) and (4.16) in complex notation, 


. E : 
B=hye"" , Ba 2 (4.17) 
C 
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This is strange because the physical electric and magnetic fields should certainly be real 
objects. You should think of them as simply the real parts of the expressions above. 
But the linearity of the Maxwell equations means both real and imaginary parts of 
E and B solve the Maxwell equations. And, more importantly, if we start adding 
complex E and B solutions, then the resulting real and imaginary pieces will also solve 
the Maxwell equations . The advantage of this notation is simply that it’s typically 
easier to manipulate complex numbers than lots of cos and sin formulae. 


However, you should be aware that this notation comes with some danger: whenever 
you compute something which isn’t linear in E and B — for example, the energy stored 
in the fields, which is a quadratic quantity — you can’t use the complex notation above; 
you need to take the real part first. 


4.3.2 Polarisation 


Above we have presented a particular solution to the wave equation. Let’s now look 
at the most general solution with a fixed frequency w. This means that we look for 
solutions within the ansatz, 


E = Ep étt and B=Boe**“) (4.18) 


where, for now, both Eo and Bo could be complex-valued vectors. (Again, we only get 
the physical electric and magnetic fields by taking the real part of these equations). 
The vector k is called the wavevector. Its magnitude, |k| = k, is the wavenumber and 
the direction of k points in the direction of propagation of the wave. The expressions 
(4.18) already satisfy the wave equations (4.13) and (4.14) if w and k obey the dispersion 
relation w? = c?k?. 

We get further constraints on Eo, Bo and k from the original Maxwell equations. 
These are 

V-E=0 =} wk Eg=0 
OB 


Nea > ik x Ep = iwBo 


Let’s now interpret these equations: 


Linear Polarisation 


Suppose that we take Ep and Bo to be real. The first two equations above say that both 
Eo and Bo are orthogonal to the direction of propagation. The last of the equations 
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above says that Ep and Bo are also orthogonal to each other. You can check that the 
fourth Maxwell equation doesn’t lead to any further constraints. Using the dispersion 
relation w = ck, the last constraint above can be written as 


k x (Ep/c) = Bo 


This means that the three vectors k, Eo /c and Bo form a right-handed orthogonal triad. 
Waves of this form are said to be linearly polarised. The electric and magnetic fields 
oscillate in fixed directions, both of which are transverse to the direction of propagation. 


Circular and Elliptic Polarisation 


Suppose that we now take Ep and Bo to be complex. The actual electric and magnetic 
fields are just the real parts of (4.18), but now the polarisation does not point in a fixed 
direction. To see this, write 


Eo =a— iB 
The real part of the electric field is then 
E = acos(k- x — wt) + Gsin(k- x — wt) 


with Maxwell equations ensuring that œ -k = 8-k = 0. If we look at the direction of 
E at some fixed point in space, say the origin x = 0, we see that it doesn’t point in 
a fixed direction. Instead, it rotates over time within the plane spanned by œ and 8 
(which is the plane perpendicular to k). 


A special case arises when the phase of Ep is e’"/4, so that |a| = |6|, with the further 
restriction that a-@=0. Then the direction of E traces out a circle over time in the 
plane perpendicular to k. This is called circular polarisation. The polarisation is said 
to be right-handed if B = k x a and left-handed if B = —k x a. 


In general, the direction of E at some point in space will trace out an ellipse in the 
plane perpendicular to the direction of propagation k. Unsurprisingly, such light is said 
to have elliptic polarisation. 


General Wave 


A general solution to the wave equation consists of combinations of waves of different 
wavenumbers and polarisations. It is naturally expressed as a Fourier decomposition 
by summing over solutions with different wavevectors, 
ak i(k-x—wt 
E(x, t) = f (ns E(k) e” ) 
Here, the frequency of each wave depends on the wavevector by the now-familiar dis- 
persion relation w = ck. 
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4.3.3 An Application: Reflection off a Conductor 


There are lots of things to explore with electromagnetic waves and we will see many 
examples later in the course. For now, we look at a simple application: we will reflect 
waves off a conductor. We all know from experience that conductors, like metals, look 
shiny. Here we'll see why. 


Suppose that the conductor occupies the half of space 


x > 0. We start by shining the light head-on onto the SS 
surface. This means an incident plane wave, travelling in E inc k 
: ; ——— 
the x-direction, = 
Eine = Evy el(kx—wt) or 
E ref = 
where, as before, w = ck. Inside the conductor, we know 


that we must have E = 0. But the component E -ý lies 
tangential to the surface and so, by continuity, must also Pigure ads 
vanish just outside at x = 07. We achieve this by adding a reflected wave, travelling 


in the opposite direction 
Ever = — Ep ve 


So that the combination E = Eine + Eyer satisfies E(x = 0) = 0 as it must. This 
is illustrated in the figure. (Note, however, that the figure is a little bit misleading: 
the two waves are shown displaced but, in reality, both fill all of space and should be 
superposed on top of each other). 


We've already seen above that the corresponding magnetic field can be determined 
by V x E = —0B/0dt. It is given by B = Bine + Bref, with 


E ' E ! 
Bine = 2 2 eee and Bret = = Zz gee (4.19) 
C C 


This obeys B -n = 0, as it should by continuity. But the tangential component doesn’t 

vanish at the surface. Instead, we have 

2Eo 

— e 
c 


B. Êl o- = —iwt 
Since the magnetic field vanishes inside the conductor, we have a discontinuity. But 
there’s no mystery here. We know from our previous discussion (3.6) that this corre- 
sponds to a surface current K induced by the wave 
K = 2Eo y e wt 
CHO 
We see that the surface current oscillates with the frequency of the reflected wave. 
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Reflection at an Angle 


Let’s now try something a little more complicated: we’ll send in 
the original ray at an angle, 6, to the normal as shown in the 


figure. Our incident electric field is oL 
Eine = Eo y ey We 


Figure 45: 


where 
k= kcosĝ $ + ksin ĉ 


Notice that we’ve made a specific choice for the polarisation of the electric field: it is 
out of the page in the figure, tangential to the surface. Now we have two continuity 
conditions to worry about. We want to add a reflected wave, 


Eyer = — Eo ¢ a) 


where we've allowed for the possibility that the polarisation È, the wavevector k’ and 
frequency w’ are all different from the incident wave. We require two continuity condi- 
tions on the electric field 


(Eine + Eyes): =O and (Eine + Eyer) x Â = 0 


where, for this set-up, the normal vector is n = —x. This is achieved by taking w’ = w 
and Ç = y, so that the reflected wave changes neither frequency nor polarisation. The 
reflected wavevector is 


k’ = —k cos $ + ksin 0 ĉ 


We can also check what becomes of the magnetic field. It is B = Bine + Bret, with 


Note that, in contrast to (4.19), there is now a minus sign in the reflected Byer, but 
this is simply to absorb a second minus sign coming from the appearance of k’ in the 
polarisation vector. It is simple to check that the normal component B-n vanishes at 
the interface, as it must. Meanwhile, the tangential component again gives rise to a 
surface current. 
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The main upshot of all of this discussion is relationship between k and k’ which tells 
us something that we knew when we were five: the angle of incidence is equal to the 
angle of reflection. Only now we’ve derived this from the Maxwell equations. If this is 
a little underwhelming, we’ll derive many more properties of waves later. 


4.3.4 James Clerk Maxwell (1831-1879) 


Still those papers lay before me, 
Problems made express to bore me, 
When a silent change came o’er me, 
In my hard uneasy chair. 

Fire and fog, and candle faded, 
Spectral forms the room invaded, 
Little creatures, that paraded 

On the problems lying there. 


James Clerk Maxwell, “A Vision of a Wrangler, of a University, of 
Pedantry, and of Philosophy” 


James Clerk Maxwell was a very smart man. Born in Edinburgh, he was a student, 
first in his hometown, and later in Cambridge, at Peterhouse and then at Trinity. 
He held faculty positions at the University of Aberdeen (where they fired him) and 
Kings College London before returning to Cambridge as the first Cavendish professor 
of physics. 


Perhaps the first very smart thing that Maxwell did 
was to determine the composition of Saturn’s rings. He H+ & 
didn’t do this using a telescope. He did it using mathe- F - RÉ ms è 
matics! He showed that neither a solid nor a fluid ring 


[a 


could be stable. Such rings could only be made of many "d ee ee 
small particles. For this he was awarded the Adams Prize. ai pe H y H =f 


(These days you can win this prize for much much less!) E 
Maxwell’s great work on electromagnetism was accom- 

plished between 1861 and 1862. He started by construct- Figure 46: Maxwell’s vor- 

ing an elaborate mechanical model of electricity and mag- tices 

netism in which space is filled by vortices of an incom- 

pressible fluid, separated by tiny rotating particles that give rise to electricity. One of 


his illustrations is shown above. Needless to say, we don’t teach this picture of space 
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anymore. From this, he managed to distill everything that was known about electro- 
magnetism into 20 coupled equations in 20 variables. This was the framework in which 
he discovered the displacement current and its consequences for light. 


You might think that the world changed when Maxwell published his work. In 
fact, no one cared. The equations were too hard for physicists, the physics too hard 
for mathematicians. Things improved marginally in 1873 when Maxwell reduced his 
equations to just four, albeit written in quaternion notation. The modern version 
of Maxwell equations, written in vector calculus notation, is due to Oliver Heaviside 
in 1881. In all, it took almost 30 years for people to appreciate the significance of 
Maxwell’s acheivement. 


Maxwell made a number of other important contributions to science, including the 
first theory of colour vision and the theory of colour photography. His work on ther- 
modynamics and statistical mechanics deserves at least equal status with his work 
on electromagnetism. He was the first to understand the distribution of velocities of 
molecules in a gas, the first to extract an experimental prediction from the theory of 
atoms and, remarkably, the first (with the help of his wife) to build the experiment and 
do the measurement, confirming his own theory. 


4.4 Transport of Energy: The Poynting Vector 


Electromagnetic waves carry energy. This is an important fact: we get most of our 
energy from the light of the Sun. Here we’d like to understand how to calculate this 
energy. 


Our starting point is the expression (4.3) for the energy stored in electric and mag- 
netic fields, 


1 
U= fas (fu-n+ B-B) 
v 2 2/40 


The expression in brackets is the energy density. Here we have integrated this only 
over some finite volume V rather than over all of space. This is because we want to 
understand the way in which energy can leave this volume. We do this by calculating 


dU dE 1. ôB 
See d E. i f 
dt I i (« a =) 


= f dx (E.V xB)-E-3-ŻB.(VxE)) 


— 92 — 


where we’ve used the two Maxwell equations. Now we use the identity 
E-(V x B)-—B-(V x E) =-V.-(E~xB) 


and write 


dU 1 
dU de J-B-— [ (Bx B)-aS 4.20 
dt 7 Lo A ) l ) 


where we’ve used the divergence theorem to write the last term. This equation is 
sometimes called the Poynting theorem. 


The first term on the right-hand side is related to something that we’ve already seen 
in the context of Newtonian mechanics. The work done on a particle of charge q moving 
with velocity v for time ôt in an electric field is ÂW = qv -E ôt. The integral fy d'z J-E 
above is simply the generalisation of this to currents: it should be thought of as the 
rate of gain of energy of the particles in the region V. Since it appears with a minus 
sign in (4.20), it is the rate of loss of energy of the particles. 


Now we can interpret (4.20). If we write it as 


dU . 1 
Gt | teI B=- (E x B)-dS 

dt Jy Ho Js 

then the left-hand side is the combined change in energy of both fields and particles in 
region V. Since energy is conserved, the right-hand side must describe the energy that 
escapes through the surface S' of region V. We define the Poynting vector 


S= 2 ExB 
Ho 
This is a vector field. It tells us the magnitude and direction of the flow of energy in any 
point in space. (It is unfortunate that the canonical name for the Poynting vector is S 
because it makes it notationally difficult to integrate over a surface which we usually 
also like to call S. Needless to say, these two things are not the same and hopefully no 
confusion will arise). 


Let’s now look at the energy carried in electromagnetic waves. Because the Poynting 
vector is quadratic in E and B, we’re not allowed to use the complex form of the waves. 
We need to revert to the real form. For linear polarisation, we write the solutions in 
the form (4.17), but with arbitrary wavevector k, 


on 
E = Eosin(k-x— wt) and B= -(k x Eo)sin(k-x-— wt) 
c 
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The Poynting vector is then 


E + 
S = — k sin?(k - x — wt) 


CHo 
Averaging over a period, T = 2r /w, we have 
eee 
2CH0 


We learn that the electromagnetic wave does indeed transport energy in its direction of 
propagation k. It’s instructive to compare this to the energy density of the field (4.3). 
Evaluated on the electromagnetic wave, the energy density is 
1 
u = E. E+— B-B = eE?sin?(k -x -— wt) 
2 2o 


Averaged over a period T = 27/w, this is 


2 
2 
Then, using c? = 1/eouo,we can write 
S = cuk 


The interpretation is simply that the energy S is equal to the energy density in the 
wave u times the speed of the wave, c. 


4.4.1 The Continuity Equation Revisited 


Recall that, way back in Section 1, we introduced the continuity equation for electric 
charge, 


This equation is not special to electric charge. It must hold for any quantity that is 
locally conserved. 


Now we have encountered another quantity that is locally conserved: energy. In the 
context of Newtonian mechanics, we are used to thinking of energy as a single number. 
Now, in field theory, it is better to think of energy density €(x,t). This includes the 
energy in both fields and the energy in particles. Thinking in this way, we notice that 
(4.20) is simply the integrated version of a continuity equation for energy. We could 
equally well write it as 

oE 

OE +V-S=0 
We see that the Poynting vector S is to energy what the current J is to charge. We’ll 
explore this connection further in Section 5.6. 
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5. Electromagnetism and Relativity 


We've seen that Maxwell’s equations have wave solutions which travel at the speed 
of light. But there’s another place in physics where the speed of light plays a promi- 
nent role: the theory of special relativity. How does electromagnetism fit with special 
relativity? 


Historically, the Maxwell equations were discovered before the theory of special rel- 
ativity. It was thought that the light waves we derived above must be oscillations of 
some substance which fills all of space. This was dubbed the aether. The idea was that 
Maxwell’s equations only hold in the frame in which the aether is at rest; light should 
then travel at speed c relative to the aether. 


We now know that the concept of the aether is unnecessary baggage. Instead, 
Maxwell’s equations hold in all inertial frames and are the first equations of physics 
which are consistent with the laws of special relativity. Ultimately, it was by studying 
the Maxwell equations that Lorentz was able to determine the form of the Lorentz 
transformations which subsequently laid the foundation for Einstein’s vision of space 
and time. 


Our goal in this section is to view electromagnetism through the lens of relativity. 
We will find that observers in different frames will disagree on what they call electric 
fields and what they call magnetic fields. They will observe different charge densities 
and different currents. But all will agree that these quantities are related by the same 
Maxwell equations. Moreover, there is a pay-off to this. It’s only when we formulate 
the Maxwell equations in a way which is manifestly consistent with relativity that we 
see their true beauty. The slightly cumbersome vector calculus equations that we’ve 
been playing with throughout these lectures will be replaced by a much more elegant 
and simple-looking set of equations. 


5.1 A Review of Special Relativity 


We start with a very quick review of the relevant concepts of special relativity. (For 
more details see the lecture notes on Dynamics and Relativity). The basic postulate of 
relativity is that the laws of physics are the same in all inertial reference frames. The 
guts of the theory tell us how things look to observers who are moving relative to each 
other. 


The first observer sits in an inertial frame S with spacetime coordinates (ct, x, y, z) 
the second observer sits in an inertial frame S’ with spacetime coordinates (ct’, x’, y’, 2’). 
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If we take S’ to be moving with speed v in the z-direction relative to S then the 
coordinate systems are related by the Lorentz boost 


rt =y (2 as ct) and =y (ct — -°) (5.1) 
c 
while y’ = y and 2’ = z. Here c is the speed of light which has the value, 
c = 299792458 ms* 


Meanwhile y is the ubiquitous factor 


= 02 


The Lorentz transformation (5.1) encodes within it all of the fun ideas of time dilation 
and length contraction that we saw in our first course on relativity. 


5.1.1 Four-Vectors 


It’s extremely useful to package these spacetime coordinates in 4-vectors, with indices 
running from u = 0 tou =3 


AY = (tuz) je—0,1,2,3 


Note that the index is a superscript rather than subscript. This will be important 
shortly. A general Lorentz transformation is a linear map from X to X’ of the form 


(X) = MX" 
Here A is a 4 x 4 matrix which obeys the matrix equation 
MAnk=n © AP Noo, = Nw (5.3) 
with nv the Minkowski metric 
nN = diag(+1, —1, —1, —1) 


The solutions to (5.3) fall into two classes. The first class is simply rotations. Given a 
3 x 3 rotation matrix R obeying RTR = 1, we can construct a Lorentz transformation 
A obeying (5.3) by embedding R in the spatial part, 


0 0 0 


AM, = (5.4) 


R 


© © O]e 


These transformations describe how to relate the coordinates of two observers who are 
rotated with respect to each other. 
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The other class of solutions to (5.3) are the Lorentz boosts. These are the transfor- 
mations appropriate for observers moving relative to each other. The Lorentz transfor- 
mation (5.1) is equivalent to 


y —yv/c| O0 0 
—yu/c 0 0 
gee (ese cea (5.5) 
0 0 1 0 
0 0 0 1 


There are similar solutions associated to boosts along the y and z axes. 


The beauty of 4-vectors is that it’s extremely easy to write down invariant quantities. 
These are things which all observers, no matter which their reference frame, can agree 
on. To construct these we take the inner product of two 4-vectors. The trick is that 
this inner product uses the Minkowski metric and so comes with some minus signs. For 
example, the square of the distance from the origin to some point in spacetime labelled 
by X is 


X-X = X'h X = 7? — 2? —y? — 2 


which is the invariant interval. Similarly, if we’re given two four-vectors X and Y then 
the inner product X -Y = X"7,,Y” is also a Lorentz invariant. 


5.1.2 Proper Time 


The key to building relativistic theories of Nature is to find the variables that have 
nice properties under Lorentz transformations. The 4-vectors X, labelling spacetime 
points, are a good start. But we need more. Here we review how the other kinematical 
variables of velocity, momentum and acceleration fit into 4-vectors. 


Suppose that, in some frame, the particle traces out a worldline. The clever trick is to 
find a way to parameterise this path in a way that all observers agree upon. The natural 
choice is the proper time T, the duration of time experienced by the particle itself. 
If youre sitting in some frame, watching some particle move with an old-fashioned 
Newtonian 3-velocity u(t), then it’s simple to show that the relationship between your 
time t and the proper time of the particle T is given by 


dt 


T = 7(u) 


2/97 = 


The proper time allows us to define the 4-velocity and the 4-momentum. Suppose that 
the particle traces out a path X(T) in some frame. Then the 4-velocity is 


Similarly, the 4-momentum is P = mU where m is the rest mass of the particle. We 


Pe i (5.6) 
p 


where E = myc? is the energy of the particle and p = ymu is the 3-momentum in 


write 


special relativity. 


The importance of U and P is that they too are 4-vectors. Because all observers 
agree on 7, the transformation law of U and P are inherited from X. This means that 
under a Lorentz transformation, they too change as U — AU and P — AP. And it 
means that inner products of U and P are guaranteed to be Lorentz invariant. 


5.1.3 Indices Up, Indices Down 


Before we move on, we do need to introduce one extra notational novelty. The minus 
signs in the Minkowski metric ņn means that it’s useful to introduce a slight twist to 
the usual summation convention of repeated indices. For all the 4-vectors that we 
introduced above, we always place the spacetime index u = 0,1,2,3 as a superscript 


yu ct 
x 
This is because the same object with an index down, X,,, will mean something subtly 


different. We define 
ct 
—x 


With this convention, the Minkowski inner product can be written using the usual 


(i.e. up) rather than a subscript. 


convention of summing over repeated indices as 
XPXy = Ct —x-x 


In contrast, X“X" = t? + x? is a dumb thing to write in the context of special 
relativity since it looks very different to observers in different inertial frames. In fact, 
we will shortly declare it illegal to write things like X” X”. 
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There is a natural way to think of X, in terms of X” using the Minkowski metric 
Nu = diag(+1,—1,—1,—-1). The following equation is trivially true: 


Xp = Nu X” 


This means that we can think of the Minkowski metric as allowing us to lower indices. 
To raise indices back up, we need the inverse of n,» which, fortunately, is the same 
matrix: n’” = diag(+1,—1,—1,—1) which means we have n“?n,, = 64%, and we can 
write 


X” = "X, 


From now on, we’re going to retain this distinction between all upper and lower indices. 
All the four-vectors that we’ve met so far have upper indices. But all can be lowered 
in the same way. For example, we have 


U= & (5.7) 


This trick of distinguishing between indices up and indices down provides a simple 
formalism to ensure that all objects have nice transformation properties under the 
Lorentz group. We insist that, just as in the usual summation convention, repeated 
indices only ever appear in pairs. But now we further insist that pairs always appear 
with one index up and the other down. The result will be an object which is invariant 
under Lorentz transformations. 


5.1.4 Vectors, Covectors and Tensors 


In future courses, you will learn that there is somewhat deeper mathematics lying be- 
hind distinguishing X“ and X,,: formally, these objects live in different spaces (some- 
times called dual spaces). We’ll continue to refer to X” as vectors, but to distinguish 
them, we'll call X,, covectors. (In slightly fancier language, the components of the vec- 
tor X are sometimes said to be contravariant while the components of the covector 
X,, are said to be covariant). 


For now, the primary difference between a vector and covector is how they transform 
under rotations and boosts. We know that, under a Lorentz transformation, any 4- 
vector changes as 


X" > X'! = AH XY (5.8) 
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From this, we see that a covector should transform as 


Xp X, = Nupp X” 
= Np? X? 
= Npp A.o Xy 


Using our rule for raising and lowering indices, now applied to the Lorentz transforma- 
tion A, we can also write this as 


Xa > Ar Xv 


where our notation is now getting dangerously subtle: you have to stare to see whether 
the upper or lower index on the Lorentz transformation comes first. 


There is a sense in which A,” can be thought of a the components of the inverse 
matrix A~!. To see this, we go back to the definition of the Lorentz transformation 
(5.3), and start to use our new rules for raising and lowering indices 


Mi Noo, = Tw => A Nov = Nw 
> MAP =ð, 
=> APM, = 0% 


In the last line above, we’ve simply reversed the order of the two terms on the left. 
(When written in index notation, these are just the entries of the matrix so there’s no 
problem with commuting them). Now we compare this to the formula for the inverse 
of a matrix, 


Cs ee ô, > AT =A,’ (5.9) 


Note that you need to be careful where you place the indices in equations like this. 
The result (5.9) is analogous to the statement that the inverse of a rotation matrix is 
the transpose matrix. For general Lorentz transformations, we learn that the inverse 
is sort of the transpose where “sort of? means that there are minus signs from raising 
and lowering. The placement of indices in (5.9) tells us where those minus signs go. 


The upshot of (5.9) is that if we want to abandon index notation all together then 
vectors transform as X — AX while covectors — which, for the purpose of this sentence, 
we'll call X — transform as X — A~!X. However, in what follows, we have no intention 
of abandoning index notation. Instead, we will embrace it. It will be our friend and 
our guide in showing that the Maxwell equations are consistent with special relativity. 
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A particularly useful example of a covector is the four-derivative. This is the rela- 
tivistic generalisation of V, defined by 


o 10 
a= sa = (iiy) 


Notice that the superscript on the spacetime 4-vector X” has migrated to a subscript 
on the derivative 0,,. For this to make notational sense, we should check that 0,, does 
indeed transform as covector. This is a simple application of the chain rule. Under a 
Lorentz transformation, X” > X'" = A#, X”, so we have 
o o OX” O 
On — —> => 

OXF OX'H  OX'HOXY 

which is indeed the transformation of a co-vector. 


= (Ar O = AO, 


Tensors 


Vectors and covectors are the simplest examples of objects which have nice transfor- 
mation properties under the Lorentz group. But there are many more examples. The 
most general object can have a bunch of upper indices and a bunch of lower indices, 
Tin y,- These objects are also called tensors of type (n,m). In order to qualify 
as a tensor, they must transform under a Lorentz transformation as 


TlH Hn — AML Abn A PL N om TP P1--Pn 
Pl Pn V1 Vm 


V1.. Um 


(5.10) 


O1...Om 


You can always use the Minkowski metric to raise and lower indices on tensors, changing 
the type of tensor but keeping the total number of indices n + m fixed. 


Tensors of this kind are the building blocks of all our theories. This is because if you 
build equations only out of tensors which transform in this manner then, as long as 
the u,v,... indices match up on both sides of the equation, you’re guaranteed to have 
an equation that looks the same in all inertial frames. Such equations are said to be 
covariant. You'll see more of this kind of thing in courses on General Relativity and 
Differential Geometry. 


In some sense, this index notation is too good. Remember all those wonderful things 
that you first learned about in special relativity: time dilation and length contraction 
and twins and spaceships so on. You’ll never have to worry about those again. From 
now on, you can guarantee that you’re working with a theory consistent with relativity 
by ensuring two simple things 


e That you only deal with tensors. 


e That the indices match up on both sides of the equation. 


It’s sad, but true. It’s all part of growing up and not having fun anymore. 
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5.2 Conserved Currents 


We started these lectures by discussing the charge density p(x,t), the current density 
J(x,t) and their relation through the continuity equation, 


which tells us that charge is locally conserved. 


The continuity equation is already fully consistent with relativity. To see this, we 
first need to appreciate that the charge and current densities sit nicely together in a 


(5 


Of course, placing objects in a four-vector has consequence: it tells us how these objects 
look to different observers. Let’s quickly convince ourselves that it makes sense that 


4-vector, 


charge density and current do indeed transform in this way. We can start by considering 
a situation where there are only static charges with density pọ and no current. So 
J" = (po,0). Now, in a frame that is boosted by velocity v, the current will appear as 
J'# = AK J” with the Lorentz transformation given by (5.5). The new charge density 
and current are then 


p =p , J = -yov 


The first of these equations tells us that different observers see different charge densities. 
This is because of Lorentz contraction: charge density means charge per unit volume. 
And the volume gets squeezed because lengths parallel to the motion undergo Lorentz 
contraction. That’s the reason for the factor of y in the observed charge density. 
Meanwhile, the second of these equations is just the relativistic extension of the formula 
J = pv that we first saw in the introduction. (The extra minus sign is because v here 
denotes the velocity of the boosted observer; the charge is therefore moving with relative 
velocity —v). 


In our new, relativistic, notation, the continuity equation takes the particularly sim- 
ple form 


ðJ” =0 (5.11) 


This equation is Lorentz invariant. This follows simply because the indices are con- 
tracted in the right way: one up, and one down. 
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5.2.1 Magnetism and Relativity 


We've learned something unsurprising: boosted charge gives rise to a current. But, 
combined with our previous knowledge, this tells us something new and important: 
boosted electric fields must give rise to magnetic fields. The rest of this chapter will 
be devoted to understanding the details of how this happens. But first, we’re going to 
look at a simple example where we can re-derive the magnetic force purely from the 
Coulomb force and a dose of Lorentz contraction. 


To start, consider a bunch of positive charges eu 
+q moving along a line with speed +v and a bunch of 
i x : è i : e 3) e e © 
negative charges —q moving in the opposite direction o é ė 
: : . o ' 
with speed —v as shown in the figure. If there is equal = ae e o ° 
density, n, of positive and negative charges then the = y 
charge density vanishes while the current is 
Figure 47: 


I = 2n Aqu 


where A is the cross-sectional area of the wire. Now consider a test particle, also 
carrying charge q, which is moving parallel to the wire with some speed u. It doesn’t 
feel any electric force because the wire is neutral, but we know it experiences a magnetic 
force. Here we will show how to find an expression for this force without ever invoking 
the phenomenon of magnetism. 


The trick is to move to the rest frame of the test particle. This means we have to 
boost by speed u. The usual addition formula tells us that the velocities of the positive 
and negative charges now differ, given by 


_ vẸFu 
— 1F w/e 


But with the boost comes a Lorentz contraction which means that the charge density 
changes. Moreover, because the velocities of positive and negative charges are now 
different, this will mean that, viewed from the rest frame of our particle, the wire is 
no longer neutral. Let’s see how this works. First, we’ll introduce ng, the density of 
charges when the particles in the wire are at rest. Then the density of the +q charges 
in the original frame is 


p = qn = y(v)qno 


The charge density of the —q particles is the same, but with opposite sign, so that in 
the original frame the wire is neutral. However, in our new frame, the charge densities 
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are 


p = qna = qy(v+)no = (2 F =) y(u)y(v) qno 


where you’ve got to do a little bit of algebra to get to the last result. Since v_ > v4, 
we have n_ > n, and the wire carries negative charge. The overall net charge density 
in the new frame is 


p = qn'=q(n, —n_) = —— yu) qn 


But we know that a line of electric charge creates an electric field; we calculated it in 
(2.6); it is 
2uv y(u) qnA 
AS a(u) na . 
È 2TEor 
where r is the radial direction away from the wire. This means that, in its rest frame, 
the particle experiences a force 
nAq?v 
F' = —uy(u) —-— 
L a 
where the minus sign tells us that the force is towards the wire for u > 0. But if there’s 
a force in one frame, there must also be a force in another. Transforming back to where 
we came from, we conclude that even when the wire is neutral there has to be a force 


F= = —u = —ug (5.12) 


But this precisely agrees with the Lorentz force law, with the magnetic field given by 
the expression (3.5) that we computed for a straight wire. Notice that if u > 0 then 
the test particle — which has charge q — is moving in the same direction as the particles 
in the wire which have charge q and the force is attractive. If u < 0 then it moves in 
the opposite direction and the force is repulsive. 


This analysis provides an explicit demonstration of how an electric force in one frame 
of reference is interpreted as a magnetic force in another. There’s also something rather 
surprising about the result. We’re used to thinking of length contraction as an exotic 
result which is only important when we approach the speed of light. Yet the electrons 
in a wire crawl along. They take around an hour to travel a meter! Nonetheless, we 
can easily detect the magnetic force between two wires which, as we’ve seen above, can 
be directly attributed to the length contraction in the electron density. 
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The discussion above needs a minor alteration for actual wires. In the rest frame of 
the wire the positive charges — which are ions, atoms stripped of some of their electrons 
— are stationary while the electrons move. Following the explanation above, you might 
think that there is an imbalance of charge density already in this frame. But that’s not 
correct. The current is due to some battery feeding electrons into the wire and taking 
them out the other end. And this is done in such a way that the wire is neutral in the 
rest frame, with the electron density exactly compensating the ion density. In contrast, 
if we moved to a frame in which the ions and electrons had equal and opposite speeds, 
the wire would appear charged. Although the starting point is slightly different, the 
end result remains. 


5.3 Gauge Potentials and the Electromagnetic Tensor 


Under Lorentz transformations, electric and magnetic fields will transform into each 
other. In this section, we want to understand more precisely how this happens. At 
first sight, it looks as if it’s going to be tricky. So far the objects which have nice 
transformation properties under Lorentz transformations are 4-vectors. But here we’ve 
got two 3-vectors, E and B. How do we make those transform into each other? 


5.3.1 Gauge Invariance and Relativity 


To get an idea for how this happens, we first turn to some objects that we met previ- 
ously: the scalar and vector potentials @ and A. Recall that we introduced these to 
solve some of the equations of electrostatics and magnetostatics, 


VxE=0 3 E=-V¢ 
V-B=0 = B=VxA 


However, in general these expressions can’t be correct. We know that when B and E 
change with time, the two source-free Maxwell equations are 


vVxE+ =0 and V-B=0 


Nonetheless, it’s still possible to use the scalar and vector potentials to solve both of 
these equations. The solutions are 
OA 
PRENES ae and B=VxA 


where now @ = ọ(x, t) and A = A (x,t). 
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Just as we saw before, there is no unique choice of ø and A. We can always shift 
A — A + Vy and B remains unchanged. However, now this requires a compensating 
shift of ¢. 


og & and A>A++Vy (5.13) 


with x = x(x, t). These are gauge transformations. They reproduce our earlier gauge 
transformation for A, while also encompassing constant shifts in œ. 


How does this help with our attempt to reformulate electromagnetism in a way 
compatible with special relativity? Well, now we have a scalar, and a 3-vector: these 
are ripe to place in a 4-vector. We define 


(2) 
A 


Or, equivalently, A, = (¢/c, A). In this language, the gauge transformations (5.13) 
take a particularly nice form, 


Ay, > Ay — pX (5.14) 
where y is any function of space and time 


5.3.2 The Electromagnetic Tensor 


We now have all the ingredients necessary to determine how the electric and magnetic 
fields transform. From the 4-derivative ô, = (0/O(ct),V) and the 4-vector A, = 
(o/c, -A), we can form the anti-symmetric tensor 


Pip = Oy = Opa 
This is constructed to be invariant under gauge transformations (5.14). We have 
Fo r FoF One = 0. =f iw 


This already suggests that the components involve the E and B fields. To check that 
this is indeed the case, we can do a few small computations, 
10(-Az) _ l/c) _ Ez 


F = => 
Oe ôt Ox c 


and 
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Similar computations for all other entries give us a matrix of electric and magnetic 


fields, 


0  Es/e E/ce E;/c 
— Ey 0 -B, B 
Fy = fe y (5.15) 
—E,/c B; 0 —-B, 


—E,/c —B, B, 0 


This, then, is the answer to our original question. You can make a Lorentz covariant 
object consisting of two 3-vectors by arranging them in an anti-symmetric tensor. Fuy 
is called the electromagnetic tensor. Equivalently, we can raise both indices using the 
Minkowski metric to get 


0 —E,/e —E,/e —E,/c 
E,/c 0 —B, B, 
E,/c B, 0 —B, 
B./c -B, B, 0 


F” = phy’? Fu = 


Both F,» and F*” are tensors. They are tensors because they’re constructed out of 
objects, A,, O,, and 7,,, which themselves transform nicely under the Lorentz group. 
This means that the field strength must transform as 


pK T“ (5.16) 


Alternatively, if you want to get rid of the indices, this reads F” = AFAT. The observer 
in a new frame sees electric and magnetic fields E’ and B’ that differ from the original 
observer. The two are related by (5.16). Let’s look at what this means in a couple of 
illustrative examples. 


Rotations 


To compute the transformation (5.16), it’s probably simplest to just do the sums that 
are implicit in the repeated p and o labels. Alternatively, if you want to revert to 
matrix multiplication then this is the same as F’ = AFA’. Either way, we get the 
same result. For a rotation, the 3 x 3 matrix R is embedded in the lower-right hand 
block of A as shown in (5.4). A quick calculation shows that the transformation of the 
electric and magnetic fields in (5.16) is as expected, 


E’=RE and B’/=RB 
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Boosts 


Things are more interesting for boosts. Let’s consider a boost v in the x-direction, with 
A given by (5.5). Again, you need to do a few short calculations. For example, we have 


J = pre panis A? Al Fee 
Cc ai 
2 AP At F” + ACA F” 
o PEs Es Es 
= ce L co c 
and 
E — Fi = ee ad 
Cc 
= AD Ae + ALAF” 
E 
Se a a) 
c c 
and 


-B' = Fil = AC ie Fee 
_ Ag ge + Ahr 
YU 
= ay — yB, = —7(Bz — vE,/c’) 


The final result for the transformation of the electric field after a boost in the x-direction 
is 

E', = y(Ey — vB.) (5.17) 
and, for the magnetic field, 


By = 7 (By + 5B) (5.18) 


As we anticipated above, what appears to be a magnetic field to one observer looks like 
an electric field to another, and vice versa. 
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Note that in the limit v < c, we have E’ = E+ v x B and B’ = B. This can be 
thought of as the Galilean boost of electric and magnetic fields. We recognise E+v x B 
as the combination that appears in the Lorentz force law. We’ll return to this force in 
Section 5.4.1 where we'll see how it’s compatible with special relativity. 


5.3.3 An Example: A Boosted Line Charge 


In Section 2.1.3, we computed the electric field due to a line with uniform charge density 
7 per unit length. If we take the line to lie along the x-axis, we have (2.6) 


, 0 
E= mee (o) (5.19) 


Meanwhile, the magnetic field vanishes for static electric charges: B = 0. Let’s see 
what this looks like from the perspective of an observer moving with speed v in the 
x-direction, parallel to the wire. In the moving frame the electric and magnetic fields 
are given by (5.17) and (5.18). These read 


i i _ ny y 
2reoly? + 2? Qreg(y!? + z”? / 
z z 


0 0 
nyv nyu 
B’ = = ' 5.20 
2reoc? (y? + 2?) (2) 2reoc? (y? + 2/2) (2) (a20 


In the second equality, we’ve rewritten the expression in terms of the coordinates of S’ 


which, because the boost is in the x-direction, are trivial: y = y’ and z = z’. 


From the perspective of an observer in frame S’, the charge density in the wire is 
7! = yn, where the factor of y comes from Lorentz contraction. This can be seen in 
the expression above for the electric field. Since the charge density is now moving, the 
observer in frame S’ sees a current T” = —ynv. Then we can rewrite (5.20) as 


B’ _ Hol’ Com 
2m fy! ? + 2! 


But this is something that we’ve seen before. It’s the magnetic field due to a current 


(5.21) 


in a wire (3.5). We computed this in Section 3.1.1 using Ampére’s law. But here we’ve 
re-derived the same result without ever mentioning Ampére’s law! Instead, our starting 
point (5.19) needed Gauss’ law and we then used only the Lorentz transformation of 
electric and magnetic fields. We can only conclude that, under a Lorentz transforma- 
tion, Gauss’ law must be related to Ampére’s law. Indeed, we’ll shortly see explicitly 
that this is the case. For now, it’s worth repeating the lesson that we learned in Section 
5.2.1: the magnetic field can be viewed as a relativistic effect. 


=o 


5.3.4 Another Example: A Boosted Point Charge 
Consider a point charge Q, stationary in an inertial frame S. We know that it’s electric 


field is given by 


E Q 
Aregr? Ares? + y? + 2798/2 (3 


while its magnetic field vanishes. Now let’s look at this same particle from the frame 
S’, moving with velocity v = (v,0,0) with respect to S. The Lorentz boost which 
relates the two is given by (5.5) and so the new electric field are given by (5.17), 


Q x 
E= 4 24424 52/372 |Y 
Meg ee +y? + 2°] ~z 
But this is still expressed in terms of the original coordinates. We should now rewrite 


this in terms of the coordinates of S’, which are x’ = y(x — vt) and y’ = y and 2’ = z. 
Inverting these, we have 


E Q x’ +t! 
y 
' — / 
Are |y? (a! + vt)? + yl? + z?) Y (5.22) 


In the frame S’, the particle sits at x’ = (—vt’,0,0), so we see that the electric field 


emanates from the position of the charge, as it should. For now, let’s look at the electric 
field when t = 0 so that the particle sits at the origin in the new frame. The electric 
field points outwards radially, along the direction 


However, the electric field is not isotropic. This arises from the denominator of (5.22) 
which is not proportional to r’? because there’s an extra factor of y? in front of the x’ 
component. Instead, at t = 0, the denominator involves the combination 


npg yl gl! — (7? — 1)? + r? 


2m2 
Uy 
q? r? 
Cc 


2.2 
=a ( Y cos? + 1) r? 


c2 


ae iet no 12 
= "y a r 
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Figure 48: The isotropic field lines of a Figure 49: The squeezed field lines of a 
static charge moving charge 


where the @ is the angle between r’ and the ’-axis and, in the last line, we’ve just used 
some simple trig and the definition of y? = 1/(1 — v?/c?). This means that we can 
write the electric field in frame S” as 


/ 1 Q ^l 

E = f 
92(1 — v? sin? 0 /c2)3/2 4reor’? 

The pre-factor is responsible for the fact that the electric field is not isotropic. We see 

that it reduces the electric field along the z’-axis (i.e when 0 = 0) and increases the field 


along the perpendicular y’ and 2’ axes (i.e. when 6 = 7/2). This can be thought of as 
a consequence of Lorentz contraction, squeezing the electric field lines in the direction 
of travel. 


The moving particle also gives rise to a magnetic field. This is easily computed using 
the Lorentz transformations (5.18). It is 


B = HoQyu : 
Any? (a! vt)? l y’? l aa —y' 


5.3.5 Lorentz Scalars 


We can now ask a familiar question: is there any combination of the electric and 
magnetic fields that all observers agree upon? Now we have the power of index notation 
at our disposal, this is easy to answer. We just need to write down an object that doesn’t 
have any floating u or v indices. Unfortunately, we don’t get to use the obvious choice 
of Nf because this vanishes on account of the anti-symmetry of F“”. The simplest 
thing we can write down is 


1 E? 
Jae E B? (5.23) 


Note the relative minus sign between E and B, mirroring a similar minus sign in the 
spacetime interval. 


== 


However, this isn’t the only Lorentz scalar that we can construct from E and B. 
There is another, somewhat more subtle, object. To build this, we need to appreciate 
that Minkowski spacetime comes equipped with another natural tensor object, beyond 
the familiar metric nav. This is the fully anti-symmetric object known as the alternating 
tensor, 


ewoo +1 if uvpo is an even permutation of 0123 
—1 if uvpo is an odd permutation of 0123 


while «“”?? = 0 if there are any repeated indices. 


To see why this is a natural object in Minkowski space, let’s look at how it changes 
under Lorentz transformations. The usual tensor transformation is 


e! Vp __ BARN een se 


It’s simple to check that ¢’””? is also full anti-symmetric; it inherits this property from 
e408 on the right-hand side. But this means that ¢’“”°? must be proportional to €#”?”. 
We only need to determine the constant of proportionality. To do this, we can look at 


¢/ 0123 = AMR AER = det (A) 


Now any Lorentz transformations have det(A) = +1. Those with det(A) = 1 make 
up the “proper Lorentz group” SO(1,3). (This was covered in the Dynamics and 
Relativity notes). These proper Lorentz transformations do not include reflections or 
time reversal. We learn that the alternating tensor ¢””?? is invariant under proper 
Lorentz transformations. What it’s really telling us is that Minkowski space comes 
with an oriented orthonormal basis. By lowering indices with the Minkowski metric, 
we can also construct the tensor €),,¢ which has €9123 = —1. 


The alternating tensor allows us to construct a second tensor field, sometimes called 
the dual electromagnetic tensor (although “dual” is perhaps the most overused word in 
physics), 

O Be -B, -B: 
B, 0  E,/e —E,/c 
B, -E,/c 0 Pgh 
B, E,/ce —Es/c 0 


(5.24) 


~ 1 
Pw = e Figg = 


F¥” is sometimes also written as *F'"”. We see that this is looks just like F’” but with 
the electric and magnetic fields swapped around. Actually, looking closely you'll see 
that there’s a minus sign difference as well: F”” arises from F“” by the substitution 
E > cB and B > -E/c. 
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The statement that F“” is a tensor means that it too has nice properties under 
Lorentz transformations, 


frm — Ae Ay foo 


and we can use this to build new Lorentz invariant quantities. Taking the obvious 
square of F doesn’t give us anything new, since 


PY PR yg =F Fiw 


But by contracting F with the original F we do find a new Lorentz invariant 
1 ~ 1 
-FF =—--E-B 5.25 
4 H c ( ) 


This tells us that the inner-product of E and B is the same viewed in all frames. 


5.4 Maxwell Equations 


We now have the machinery to write the Maxwell equations in a way which is manifestly 
compatible with special relativity. They take a particularly simple form: 


ð F” = pio J” and ô, F” =0 (5.26) 
Pretty aren’t they! 


The Maxwell equations are not invariant under Lorentz transformations. This is 
because there is the dangling v index on both sides. However, because the equations 
are built out of objects which transform nicely — F#”, F", J” and O, — the equations 
themselves also transform nicely. For example, we will see shortly that Gauss’ law 
transforms into Ampére’s law under a Lorentz boost, something we anticipated in 
Section 5.3.3. We say that the equations are covariant under Lorentz transformations. 


This means that an observer in a different frame will mix everything up: space 
and time, charges and currents, and electric and magnetic fields. Although observers 
disagree on what these things are, they all agree on how they fit together. This is what 
it means for an equation to be covariant: the ingredients change, but the relationship 
between them stays the same. All observers agree that, in their frame, the electric and 
magnetic fields are governed by the same Maxwell equations. 


Given the objects F#”, FH”, J” and u, the Maxwell equations are not the only 
thing you could write down compatible with Lorentz invariance. But they are by 
far the simplest . Any other equation would be non-linear in F or F or contain more 
derivative terms or some such thing. Of course, simplicity is no guarantee that equations 
are correct. For this we need experiment. But surprisingly often in physics we find 
that the simplest equations are also the right ones. 
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Unpacking the Maxwell Equations 


Let’s now check that the Maxwell equations (5.26) in relativistic form do indeed coincide 
with the vector calculus equations that we’ve been studying in this course. We just 
need to expand the different parts of the equation. The components of the first Maxwell 
equation give 


iran = vense 
€0 


a 1 ðE 
ð F” = uoJ = “2g yV Pem 


In the first equation, which arises from v = 0, we sum only over spatial indices ¿į = 1, 2, 3 
because F°° = 0. Meanwhile the components of the second Maxwell equation give 
=): = y-B=0 


~ OB 
Oye = 0 > go are 


These, of course, are the familiar equations that we’ve all grown to love over this course. 


Here a few further, simple comments about the advantages of writing the Maxwell 
equations in relativistic form. First, the Maxwell equations imply that current is con- 
served. This follows because F*”” is anti-symmetric, so 0,0,F"” = 0 automatically, 
simply because 0,0, is symmetric. The first of the Maxwell equations (5.26) then 
requires that the continuity equation holds 


ðJ” =0 


This is the same calculation that we did in vector notation in Section 4.2.1. Note that 
it’s marginally easier in the relativistic framework. 


The second Maxwell equation can be written in a number of different ways. It is 
equivalent to: 


OF! =0 & POF, =0 & OF tOFytOuFip=0 (5.27) 
where the last of these equalities follows because the equation is constructed so that it 


is fully anti-symmetric with respect to exchanging any of the indices p, u and v. (Just 
expand out for a few examples to see this). 
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The gauge potential A,, was originally introduced to solve the two Maxwell equations 
which are contained in 0, F“” = 0. Again, this is marginally easier to see in relativistic 
notation. If we write Fi, = 0, A, — 0, A, then 


7 1 1 
ð F” = 50 Out pn — gE nlp Ac — 3sAp) = 0 


where the final equality holds because of the symmetry of the two derivatives, combined 
with the anti-symmetry of the e-tensor. This means that we could equally well write 
the Maxwell equations as 


o, F"” = uoJ” where Fw = ô Ay — 0, A, (5.28) 


The first of these coincides with the first equation in (5.26); the second is an alternative 
way of writing the second equation in (5.26). In more advanced formulations of elec- 
tromagnetism (for example, in the Lagrangian formulation), this is the form in which 
the Maxwell equations arise. 


5.4.1 The Lorentz Force Law 


There’s one last aspect of electromagnetism that we need to show is compatible with 
relativity: the Lorentz force law. In the Newtonian world, the equation of motion for 
a particle moving with velocity u and momentum p = mu is 

d 

T — ((E+uxB) (5.29) 
dt 
We want to write this equation in 4-vector notation in a way that makes it clear how 
all the objects change under Lorentz transformations. 


By now it should be intuitively clear how this is going to work. A moving particle 
experiences the magnetic force. But if we boost to its rest frame, there is no magnetic 
force. Instead, the magnetic field transforms into an electric field and we find the same 
force, now interpreted as an electric force. 


The relativistic version of (5.29) involves the 4-momentum P”, defined in (5.6), the 
proper time 7, reviewed in Section 5.1.2, and our new friend the electromagnetic tensor 
Fv’. The electromagnetic force acting on a point particle of charge q can then be 
written as 


ge, (5.30) 
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where the 4-velocity is 


dX” c 
H— n . 1 
U = () (5.31) 


and the 4-momentum is P = mU. Again, we see that the relativistic form of the 
equation (5.30) is somewhat prettier than the original equation (5.29). 


Unpacking the Lorentz Force Law 


Let’s check to see that the relativistic equation (5.30) is giving us the right physics. 
It is, of course, four equations: one for each u = 0,1,2,3. It’s simple to multiply 
out the right-hand side, remembering that U,, comes with an extra minus sign in the 
spatial components relative to (5.31). We find that the u = 1, 2,3 components of (5.30) 
arrange themselves into a familiar vector equation, 

OP asain aD) => P ge) (5.32) 

dt dt 
where we’ve used the relationship dt/dr = y. We find that we recover the Lorentz 
force law. Actually, there’s a slight difference from the usual Newtonian force law 
(5.29), although the difference is buried in our notation. In the Newtonian setting, the 
momentum is p = mu. However, in the relativistic setting above, the momentum is 
p = myu. Needless to say, the relativistic version is correct, although the difference 
only shows up at high speeds. 


The relativistic formulation of the Lorentz force (5.30) also contains an extra equation 
coming from u = 0. This reads 


=-y7E-u (5.33) 


Recall that the temporal component of the four-momentum is the energy P® = E/c. 
Here the energy is E = myc? which includes both the rest-mass of the particle and its 
kinetic energy. The extra equation in (5.30) is simply telling us that the kinetic energy 
increases when work is done by an electric field 


d(Energy) 


ae 
dt ee 


where I’ve written energy as a word rather than as E to avoid confusing it with the 
electric field E. 
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5.4.2 Motion in Constant Fields 


We already know how electric and magnetic fields act on particles in a Newtonian world. 
Electric fields accelerate particles in straight lines; magnetic fields make particles go 
in circles. Here we’re going to redo this analysis in the relativistic framework. The 
Lorentz force law remains the same. The only difference is that momentum is now 
p = myu. We’ll see how this changes things. 


Constant Electric Field 


Consider a vanishing magnetic field and constant electric field E = (E,0,0). (Note 
that E here denotes electric field, not energy!). The equation of motion (5.32) for a 
charged particle with velocity u = (u,0,0) is 


d(yu) 
dt 


m =qE => myu=qEt 


where we’ve implicitly assumed that the particle starts from rest at t = 0. Rearranging, 
we get 


dx qEt 


Uz= = 
dt ym? + PE] 


Reassuringly, the speed never exceeds the speed of light. Instead, u —> c as t > oo as 
one would expect. It’s simple to integrate this once more. If the particle starts from 


a mc ly i q? bt? 1 
-— qE mee 


For early times, when the speeds are not too high, this reduces to 


the origin, we have 


1 
max ~ qE? +... 


which is the usual non-relativistic result for particles undergoing constant acceleration 
in a straight line. 
Constant Magnetic Field 


Now let’s turn the electric field off and look at the case of constant magnetic field 
B = (0,0, B). In the non-relativistic world, we know that particles turn circles with 
frequency w = qB/m. Let’s see how relativity changes things. 


=i 


We start by looking at the zeroth component of the force equation (5.33) which, in 
the absence of an electric field, reads 


we 
dt 

This tells us that magnetic fields do no work. We knew this from our course on 

Newtonian physics, but it remains true in the relativistic context. So we know that 

energy, E = myc’, is constant. But this tells us that the speed (i.e. the magnitude of 

the velocity) remains constant. In other words, the velocity, and hence the position, 

once again turn circles. The equation of motion is now 


d(yu) 
"Tt 


Since y is constant, the equation takes the same form as in the non-relativistic case 


=quxB 


and the solutions are circles (or helices if the particle also moves in the z-direction). 
The only difference is that the frequency with which the particle moves in a circle now 
depends on how fast the particle is moving, 


qB 
a 
my 


If you wanted, you could interpret this as due to the relativistic increase in the mass 
of a moving particle. Naturally, for small speeds y ~ 1 and we reproduce the more 
familiar cyclotron frequency w = qB/m. 


So far we have looked at situations in which E = 0 and in which B = 0. But we’ve 
seen that E-B = 0 and E?—B? are both Lorentz invariant quantities. This means that 
the solutions we’ve described above can be boosted to apply to any situation where 
E- B = 0 and E? — B? is either > 0 or < 0. In the general situation, both electric and 
magnetic fields are turned on so E - B Æ 0 and we have three possibilities to consider 
depending on whether E? — B? is > 0 or < 0 or = 0. 


5.5 ...and Action 


The principle of least action provides an elegant and powerful way to think about the 
classical mechanics of particles. In this section we will see that the action principle can 
also be used to describe classical fields. 


5.5.1 Non-Relativistic Particles 


The principle of least action was described in some detail in the lectures on Classical 
Dynamics. For a particle moving along a trajectory x(t), subject to the potential V(x), 
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the action is given by 


S{x(t)] = [ dt 5m? = v| (5.34) 


We fix the position of the particle at time tı and tg. The principle of least action says 
that when the particle moves between these two points, it takes a path that extremises 
the value of the action. 


It is simple to show that the principle of least action is equivalent to Newtonian 
equation of motion. We vary the path, x(t) > x(t) + ôx(t), subject to the requirement 
that 6x(t,) = ôx(t2) = 0 so that the end points are fixed. The change in the action is 
then 


t2 

se = f dt [mx - öx- VV - dx| 
ti 

t2 


= f'a |- mš- vv] TRE [mx - bx] 


ty tı 


where the second line follows after integration by parts. The boundary term vanishes 
because the end points are fixed. The path x(t) extremises the action if 6S = 0 for all 
variations ôx(t). This holds only if if the 


mx = -VV 
which we recognise as the Newtonian equation of motion. 


For this course, we’re interested in writing down the action for a particle of charge q 
interacting with electric and magnetic fields. This is written in terms of the potential 
(x) and the vector potential A(x). It is 


S{x(t)] = | dt Son? — goes) + a+ AG (5.35) 


We will now show that this reproduces the Lorentz force law. The electric term involv- 
ing ¢ is just of the usual potential energy type and the fact it gives the right equation 
of motion follows immediately from the definition of the electric field E = —V@¢. Mean- 
while, we have a short calculation to do for the the magnetic force. It is a calculation 
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that is best done in index notation 


5 f e [z A'(x)] 


tı tı 


t2 OA’ OA. 
— -= t rt J 
[ dt | ÔT m +o] 


Ate 28s 


I 
a 
~ 


i A (x) + tA (x)| 


I 
a 


“on or 


tz i PT os 
D 28] ate 


t2 t2 

= / dt eiki óx BY = f dt (x x B) - 6x 
ty ty 

where, in the second line, we’ve integrated by parts and thrown away the boundary 

term and, in the third line, we’ve relabelled the indices in the second term. In the final 

line, we’ve used the definition of the magnetic field B = V x A. The net result is that 

varying the action (5.35) indeed reproduces the Lorentz force law 


mx = q(E +x x B) 


There’s something interesting about the action (5.35). The potentials ¢ and A have 
been our constant companions throughout these lectures but, until now, they’ve only 
played an auxiliary role. They were useful in helping us solve the Maxwell equations. 
But they weren’t necessary. At any stage, we could have worked just with E and B and 
not worried about the underlying potentials. That’s no longer true when we turn to 
the Lagrangian formulation. There’s no Lagrangian formulation of electromagnetism 
that involves only E and B. Instead, you’re obliged to use the potentials @ and A. 
This is true for both the point particle action (5.35) and for the action that we’ll meet 
shortly that leads to the Maxwell equations. 


Whenever some mathematical object is written in terms of ọ and A, some minor 
alarm bells should start to ring. This is because the are not unique functions, but are 
defined only up to gauge transformations (5.13). 


og & and A->A+Vyx 


with y = x(x,t). Anything physical should not depend on the choice of x. This is 
true for the electric and magnetic fields E and B. Happily, it is also true for the action 
(5.35). Under a gauge transformation, this shifts as 


o 2 d 
ERES =s+4 f d X 
tı 


t2 
s= 8+4 | dt Ot 7 


ty 
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We see that the change of the action is a total time derivative. But we know from 
the lectures on Classical Dynamics that adding a total derivative to the action doesn’t 
change the physics. 


5.5.2 Relativistic Particles 


Our first task is to write down an action for a relativistic particle. As we’ll see, there 
are two ways to do this; the first is simpler, but the second is better. 


A simple action for a relativistic particle is 


S[x(t)] = -më f at \/1— z (5.36) 


First note that if we Taylor expand the action, we get back the familiar Newtonian 
action (5.34) for a free particle. More importantly, the canonical momentum associated 
to the action is 


OS 


x 


where y = (1 — x?/c?)~'/? is the usual relativistic gamma factor. This then gives us 
the right equation of motion for a free relativistic particle, 


It’s straightforward to couple this particle to electric and magnetic fields: we just 
include the same terms that we saw in (5.35), 


ppt) | = fu (-mey 1— z — q + qx- a) (5.37) 


Although this action gives the right relativistic equations of motion, there’s something 
more than a little unsatisfactory about it. This is because it gives the equations of 
a motion is a very particular reference frame, with a very particular choice of time 
coordinate t. And that’s not really in the spirit of special relativity. Indeed, the 
essence of Minkowski space is that time and space sit on a very similar footing, with 
Lorentz transformations rotating the two. How can we write down an action that puts 
time and space on the same footing and manifestly exhibits invariance under Lorentz 
transformations? 
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The Covariant Action 


As we will now see, the construction of such an action needs a new ingredient. To start 
with, we’ll follow our nose. It’s clear that if we want an action with manifest Lorentz 
invariance then we should work with the four-vector X” = (ct,x). The worldline of a 
particle is then some parameterised curve 


X"(a) 


with o is a label that tells us where we sit on the curve. This four-vector will be our 
degree of freedom in constructing an action. 


It’s worth pausing to stress just how different our current situation is from the original 
form of the relativistic action (5.36). For (5.36), we constructed an action based on the 
path x(t), where x is the degree of freedom and the time t is used to parameterise the 
curve. But now we’re going to construct an action based on X“(c), which means that 
we've promoted time to a dynamical degree of freedom, sitting alongside x. That’s 
going to need some explaining. After all, the number of degrees of freedom is one of 
the crudest ways we have to describe a system and usually if we add an extra degree 
of freedom, we’re going to be describing something rather different. But here we’re not 
aiming at describing the same physical system as (5.36) — a relativistic particle — just 
with the symmetries manifest. 


Relatedly, we have now introduced a different parameter ø that describes where we 
sit on the worldline X#(o). What choice of o should we take? For now we'll just let o 
be any parameterisation that we like. We’ll soon see that, in fact, this is more or less 
the right answer! 


If we’re working with X“(c) as our degree of freedom, it’s straightforward to con- 
struct an action that exhibits Lorentz invariance. The one that works turns out to 


be 
va dX! dX” 
H =e e -A 
S|X*(a)| me f do Af Nuw e (5.38) 


The coefficients in front ensure that the action has dimensions [S] = Energy x Time as it 
should. We see immediately that this action is invariant under Lorentz transformations 
X! — A#,X” that we saw earlier in (5.8). This follows just because the integrand is 
a tensor with the u,v indices contracted correctly. For this reason, (5.38) is known as 
the covariant action. 
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The action S is actually closely related to something familiar from the world of special 
relativity: it is proportional to the proper time experienced by the particle. Recall that 
a particle moving along a worldline X“(o), experience a proper time 


1 f° dX! dX” 
— ' seal 5.39 
me) A do fm do’ do' ( ) 


In special relativity, the proper time is maximised by a particle that does not accelerate. 
This fact is famous from the twin paradox where the dull stay-at-home twin ages fastest. 
Here it sits nicely with the fact that the proper time is identified with the action and 
hence is extremised on solutions to the equations of motion. 


In addition to Lorentz invariance, the action (5.38) has a second symmetry of a very 
different kind, and this is the key to understanding the issues that we raised above. This 
second symmetry is reparameterisation invariance. Suppose that we pick a different 
parameterisation of the path, o, related to the first parameterisation by a monotonic 
function G(o). Then we could equally as well construct an action S using this new 
parameter, given by 


7 OO aes dX" dX” 
S = -me f da nw- d 


We might worry that this different parameterisation will give different equations of 
motion. Happily this is not the case because the two actions are, in fact, identical 


r %2 dé dX" dX” (da\? 
=_lClc d — v z — 
2 me | ° To j» do do (z) j 


t 


We see that the action takes the same form regardless of our choice of parameterisa- 
tion. Although we’ve called this a “symmetry”, it’s not a symmetry in the same sense 
as Lorentz transformations. In particular, reparameterisation does not generate new 
solutions from old ones. Instead, it is a redundancy in the way we describe the system. 
It is similar to the gauge “symmetry” of Maxwell and Yang-Mills theory which, despite 
the name, is also a redundancy rather than a symmetry. 


Reparameterisation invariance has a number of consequences. The first is that it 
explains why the action (5.38) has only three degrees of freedom, even though it is a 
function of four variables X“(o). This is because one of the degrees of freedom X“ is 
not physical. Suppose that you solve the equation of motion to find a trajectory X” (øo). 
In most dynamical systems, each of these four functions would tell you something about 
the physical trajectory. But, for us, reparameterisation invariance means that there is 
no actual information in the value of a. To find the physical path, we should eliminate 
c to find the relationship between the X”. And this kills one degree of freedom. 
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We can see this most clearly by making a cunning choice for the parameter o that 
parameterises the worldline. Suppose that we choose ø to coincide with the time t for 
some intertial observer: o = t. Then dX°/do = and the action (5.38) then becomes 


2 
S = -mce f ayi- 5 


where here x = dx/dt. But this is the action (5.36) that we started this section 
with. So our two actions (5.38) and (5.36) are indeed equivalent, but each has different 
advantages. The action (5.36) makes it clear that we are dealing with a system with 
three degrees of freedom x, but Lorentz invariance is hidden. Meanwhile the action 
(5.38) has manifest Lorentz invariance, but at the cost of introducing more degrees of 
freedom than are physical. But, as we’ve seen above, the reparameterisation invariance 
of the action allows us to remove the time degree of freedom and return to (5.36). 


There’s yet another manifestation of reparameterisation invariance. To see this, we 
compute the canonical momentum associated to X”, 


OL 1 
LZ po, T OMe- 
OX X’X, 
where here X” = O0X"/Ao. You can check that P” above coincides with the four- 
momentum P” = mdX"/dr that we defined previously in (5.6). (This follows from the 
fact that the proper time 7 defined by (5.39) dr/do = L/mc? with L the Lagrangian.) 


It’s a familiar result from special relativity that these momenta are not all independent, 
but obey 


Pee =e (5.40) 


While this result is familiar in special relativity, it’s rather surprising from the per- 
spective of Lagrangian mechanics. This novel feature can be traced to the existence 
of reparameterisation invariance, meaning that there was a redundancy in our original 
description. Indeed, whenever theories have such a redundancy there will be some con- 
straint analogous to (5.40). As one final comment, note that if we expand out (5.40), 
we have 


(P? =p + mc? 


In particular, we see that we must have P° Æ 0. This is important. There’s nothing 
that tells us that we must have p # 0. The particle is quite able to just sit still in 
space if it wants. But P? Æ 0 tells us that the particle is obliged to move in the time 
direction. Physically, this again reflects the fact that the action (5.38) has only three 
degrees of freedom, not four. Physiologically, this is why you get old. 
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Finally, we can couple the covariant action (5.38) to electromagnetism. We do this 
by introducing the gauge field four-vector A” = (¢/c, A) and extend the action (5.38) 


to 
= dX! dX” dX" 
H — — tC — oy 
S|X*(a)| [ do | me! Ny a gA,,(X) a (5.41) 


If we again pick the worldline parameter o to coincide with the time of some inertial 
observer, g = t, then we again find that this action coincides with our earlier result 
(5:37 i 


5.5.3 The Maxwell Action 


Our next goal is to write down an action principle for the Maxwell equations. Again 
we need a change of perspective which, this time, is just the usual shift from thinking 
about particles to thinking about fields. The action associates a number S to every 
field configuration E(x,t) and B(x,t). We will show that the action that reproduces 
the Maxwell equation takes the beautifully compact form 


S[A,(x,t)] =-—— | d'r FF (5.42) 


Before we compute the equations of motion, here are a number of comments. 


e The action is Lorentz invariant. This is true both of the integrand Fu F”” and 
the measure dx = cdtd?x. Under a Lorentz transformation (5.8), the measure 
picks up a Jacobian factor det A = 1. 


e For a non-relativistic particle, the action takes the form of “kinetic energy minus 
potential energy”. But there is a similar interpretation of the Maxwell action 
(5.42). Expanding out the integrand using (5.23), we have 


= 3 0. | n2 
S= | ata's ($x zB?) 


Comparing to the energy stored in electric and magnetic fields that we derived 
in (4.3), we see that E? is like the kinetic energy, while B? is like the potential 
energy. 


e As we can see, the action depends on the electric field E(x, t) and magnetic field 
B(x,t). Nonetheless, the action should be viewed as a functional of the underlying 
gauge field A,,(x,t), albeit one that is invariant under gauge transformations 
A, — A, — O,x. This mirrors what we saw for the action for the Lorentz force 
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law (5.35) where we were also obliged to introduce the scalar and vector potentials. 
The need to view the Maxwell action (5.42) as functional of the gauge potential 
Ap is reflected in the fact that we should vary with respect to A, rather than E 
or B, when deriving the equations of motion. This is what we do next. 


We vary the action by considering a neighbouring configuration A, +6A,. Using the 
definition of the electromagnetic tensor is Fy = 0,A,—0,A,, the change in the action 
is 


5S = -> dtr 2(0,5A, — 0,6A,) FH 


Ho 
er d*x F#0,5A, 
Ho 
1 
= — | d'z (3, F#”)5A, 
Ho 


where, as usual, we have discarded the total derivative term after integrating by parts. 
We see that the principle of least action, ôS = 0, gives the vacuum Maxwell equations 


3LF” =0 


Note that we only get half the Maxwell equations from the variation of the action. The 
other half, 0, F“” = 0 follow immediately from working with the gauge potential A,,. 


The action (5.42) gives the vacuum Maxwell equations. If we have some fixed current 
J", we can modify the action to read 


1 i 
S[A,] = F E (-+” Fiy = Ay) (5.43) 
Repeating the steps above, we now get the Maxwell equation (5.28), 
One = uoJ” 


The current J” in (5.43) couples directly to the gauge potential A„. This introduces a 
level of jeopardy, because the action should be invariant under gauge transformations 
A, — A, +0,x. Under such a gauge transformation, the action shifts as 


1 1 
S — S+ fas (3 xX) J" = S = r jes xla J”) 


We see that the action is invariant only if the current is conserved, meaning „J“ = 0. 
But this, of course, is the expected property of the electric current. We see that 
the action principle introduces a nice interplay between gauge invariance and current 
conservation. 
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We can combine our Maxwell action (5.42) with the action for a relativistic point 
particle (5.41). We then have 


1 dX! dX dX” 


Comparing the last term to that in (5.43), we see that the current from a relativistic 


particle takes the form 


dX” 
u a =X 
J ac | do ae O° (x (c)) 


The Theta Term 
As we saw previously, there is one other Lorentz invariant term that we can construct 
from the electric and magnetic fields. This is (5.25) 


ites 1 
=F" Fp = =— Ee B 
4 (E 


We might wonder what would happen if we were to add this term to the Maxwell action 
(5.42). To answer this, we need to think about what the term F, F*” looks like when 
written in terms of the gauge potential A,. We have 


~ 1 
PVF y = E Ppt w SO Ag Puy =o 0 Aa lap) 


where the last equality holds because the derivatives in F, are anti-symmetrised with 
p. The upshot is that this term is a total derivative and total derivatives don’t affect 
the equations of motion. So adding such a term doesn’t do anything. 


In fact, that last statement is only partially true. Adding total derivatives to the 
action doesn’t change the classical equation of motion. But it can change the quantum 
theory in subtle and interesting ways. That’s also true here, where the term FY” F uv ÍS 
known as the theta term. (Named, unhelpfully, after the coefficient that sits in front of 
it which is usually called 6.) The theta term has an interesting role to play in, among 
other places, the story of topological insulators. You can read more about this in the 
lectures on Gauge Theory. 


5.6 More on Energy and Momentum 


The electric and magnetic fields carry both energy and momentum. The purpose of 
this section is to further explore their properties. 


== 


5.6.1 Energy and Momentum Conservation 


The energy density stored in the electric and magnetic fields is (4.3), 


e = pp 4 _* (5.44) 

2 2Ho 
The importance of energy lies in the fact that it’s conserved. Because we’re dealing 
with an energy density, it must be conserved locally which means that there must be 
an underlying continuity equation. This is the essence of Poynting’s theorem that we 
derived in Section 4.4. This follows by taking the time derivative and using the Maxwell 


equations 
oE OE 1 oB 
— = oE. -—B. 
a Bt m Ot 
= See ee PNE 
Lo Ho 


which we can write as 


£ +V-8=-E-J with S=ExB (5.45) 
Here S is the Poynting vector that we introduced previously in Section 4.4. It has the 
interpretation of the energy current. In the absence of any external electric current, so 
J = 0, (5.45) tells us that energy in the electromagnetic field is conserved. However, 
if there are electric currents J # 0 around, then the electric field does work on them, 
extracting energy from the field. That’s the meaning of the right-hand side of (5.45). 


The derivation above shows that the Poynting vector S can be viewed as the flow of 
energy carried by the electromagnetic field. But it also has a second, closely related 
interpretation: it is the momentum in the electromagnetic field. More precisely, the 
electromagnetic momentum density is 


1 
P=—8=«E xB (5.46) 


Momentum is also conserved, and that means that there must be a second continuity 
equation involving the time derivative of P .And there is. We have 


oP JE OB 
T a(i B+E) 
= +(VxB)xB-JxB—6Ex (VE) 


Lo 
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We use the vector identity 
1 
(V x B) x B = (B-V)B- ZVB? 


with a similar expression for E. At this point, it’s helpful to revert to index notation. 
We have 


OP; 1 1 : 


1 1 1 
; |> | BiB; — +6; B? E,E; — -0y E? 
0 [a (P230) to (B = 7608!) 


1 
Ho 


The first term in square brackets is a total derivative. That’s just what we want for a 
continuity equation. Meanwhile, we replace the V - B and V - E terms on the final line 
by the appropriate Maxwell equation. The end result is three continuity equations, one 
for the momentum in each different direction 


OP; 
at + 0; 0% = —(pE -Jx B); (5.47) 
where oj; is the collection of terms in the previous square bracket 
Tere iE 
Oij = €0 zôyE — kik; ae zô; B = BB, (5.48) 
2 Ho 2 


This is known as the Maxwell stress tensor. Note that it is symmetric. We’ll come back 
to this shortly. We also met a stress-tensor g;; in our lectures on Fluid Mechanics: they 
are conceptually the same object. 


In the absence of any charges or currents, the right-hand side of (5.47) vanishes and 
we learn that the vector P is conserved. But we recognise the right-hand of (5.47) as 
the force density on charges and currents. If the currents are mobile electrons, then 
this force will increase their momentum and so we expect a corresponding decrease of 
the momentum in the electromagnetic field. That’s indeed what we see. 


As we’ve seen, the momentum density P and the energy flux S are proportional: 
P =S/c’. There are two ways to see why the factor of c? is needed. The first is that 
it ensures that the right-hand side of (5.47) is the force experienced by charges and 
currents, so that (5.47) can be viewed as a field theoretic generalisation of “F = ma”. 
The second is to invoke some quantum mechanical intuition, where the energy and 
momentum of photons are related by p = E/c. That accounts for one factor of c. The 
other arises because the energy flux is Ec, so the momentum is p = (Ec)/c’. 
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5.6.2 The Energy-Momentum Tensor 
There is an interesting interplay between field theory and relativity. This is illustrated 


by the fact that the energy density can actually be viewed as the zeroth component of 
two different four vectors! 


The first of these four vectors follows because we’re dealing with a field theory. This 
means that energy (or, more precisely, energy density) is conserved locally and sits in 
a current J” = (E, S/c), which obeys 0,J" = 0 as seen in (5.45). (Recall that ô = +0; 
which is why there’s that extra factor of c in the energy flux.) 


But, in relativistic particle mechanics, the energy sits in a four vector with momentum 
(5.6). This suggests that we can also form the four vector (E/c,P). What’s going on? 


In fact, the energy density naturally sits not in a vector, but in a rank 2 tensor. This 
is known as the stress-energy tensor, or sometimes as the energy-momentum tensor, 
and sometimes, rather lazily and confusingly, just as the stress-tensor. It takes the 


puw — E cP; 
Si/c Oij 


This includes both four-vectors above, one as a row vector and the other as a column 


form 


vector. Because of the relation (5.46) between the energy flux S and the momentum 
density P, the energy-momentum tensor is actually symmetric 


TH — T™e 
We’ll return to this observation shortly. 


Above, we argued that T,» should be a tensor on the grounds that the energy density 
E can be viewed as the zeroth component of two different four-vectors. Putting in the 
various definitions for the energy density € (5.44), the Poynting vector (5.45), the 
momentum P (5.46), and the stress tensor oj; (5.48), you can check that the energy- 
momentum tensor can be constructed from the electromagnetic tensor F” defined in 


(5.15). We have 


1 1 
pw oF (rer, = TIF” Fn (5.49) 
For example, the T°? component is 
00 1 Op p0 1 po 1 2/2 1 2 Jaa 2 
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This shows that 7” is indeed a tensor as advertised, meaning that it has the appropriate 
transformation law. Under a Lorentz transformation A, we have 


ee NP eee (5.50) 


The column vectors are constructed out of conserved currents for energy and momentum 
respectively. This means that, in vacuum, the energy-momentum tensor obeys 


grh =0 


for each v = 0,1,2,3. This captures the conservation of energy and momentum. Of 
course, because TH” = T”", we also have 0,T"” = 0. 


If we turn on background electric charges p and electric currents J then, as we have 
seen, T“” is not conserved as the electromagnetic fields do work. From (5.49), we have 


1 
Hoby TH, = OF) Fpp = F". 0, Fpp + gE OLE po 


1 
— Ho? F", — 3E” (OpFua — oF up — OF pa) 


To get to the last line, we’ve used Maxwell’s equations in the form 0,.F"" = uoJ” and 
engaged in some relabelling of dummy indices. (The F°, Fpp term in the first line is 
split into two, with the dummy indices relabelled differently in the two cases.) But the 
final term in brackets vanishes, a fact that is equivalent to the other set of Maxwell 
equations ð, F H = () as we previously noted in (5.27). The upshot is that, in the 
presence of charges and currents, we have 


ð T” = -F J, 
This combines our previous equations (5.45) and (5.47) into tensor form. 


All relativistic field theories have an energy-momentum tensor. This plays a special 
role in a number of contexts, not least in General Relativity where T’” sits on the 
right-hand side of the Einstein equations and sources the gravitational field, in much 
the same way as J” sources the electromagnetic field in these lectures. 


The energy-momentum tensor (5.49) has one further property that is special to 
Maxwell theory: it is traceless 


I, =0 


This follows from (5.49) because n” n, = 4. Although we won’t show it here, it turns 
out that the energy-momentum tensor is traceless because of a special symmetry of 
Maxwell theory known as conformal symmetry. 
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If we have a gas of photons that are homogeneous, then the energy-momentum tensor 
necessarily takes the form 


T™ = diag(E, P, P, P) 


Here the diagonal entries P of the stress-tensor have the interpretation of pressure. 
The tracelessness of the energy-momentum tensor tells us that the energy density and 
pressure are related by € = P/3. This fact plays an important role in Cosmology. 


5.6.3 Angular Momentum 


In classical mechanics, there are three important conserved quantities: energy, momen- 
tum and angular momentum. But for our electromagnetic fields, we have described 
only the first two. We now rectify this. 


In fact, we’ll see that it will be fruitful to think more generally about any field theory 
that has a conserved energy density € and conserved momentum density P, such that 
the following two continuity equations hold: 


oE OP; 

ae a and p 1 21% = 0 
We know that in Maxwell theory the energy flux S is proportional to the momentum 
P. This, ultimately, was responsible for the symmetry T” = T””. In what follows, we 
won't assume any relation between S and P. Instead, we will see that this is a require- 


ment of the conservation of angular momentum, together with Lorentz symmetry. 


Following our nose from classical mechanics, we expect that the angular momentum 
density of the field is 


Lix) =x x P(x) 


We can ask if this is conserved. Differentiating by time only acts on P, not on the 
vector x above which simply tells us the point in space that we’re looking at. We then 
have 


OL; OP, 
a at 
We see that we get a continuity equation for angular momentum 
OL; 
Ot 


only if the stress tensor is symmetric: oj; = Oji- 


= — Eijk {OTK = —CijkO! (L{OM) + EijkOkj 


+ o; (eqn tren) =0 (5.51) 
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The stress tensor g;; also plays a role in Fluid Mechanics. In that context, we gave 
a slightly awkward argument that o;; should be symmetric by showing that something 
bad would happen if it wasn’t. That something bad was that a finite torque would give 
rise an infinite angular velocity. That’s closely related to the much simpler derivation 
above that shows we have conservation of angular momentum if and only if oj; is 
symmetric. 


The discussion above holds for any field theory with rotational invariance. How- 
ever, if we have a Lorentz invariant theory like electromagnetism then it tells us that 
the energy-momentum tensor must also be symmetric. This follows from the Lorentz 
transformation law (5.50). If T is symmetric in one frame then it is symmetric in all 
frames if and only T® = T°. This relates the energy flux and momentum as in (5.46). 


Just as the energy density and momentum density sit nicely in a Lorentz invariant 
tensor 7”, so too does the angular momentum density. However, this time it’s a 
3-tensor, 


SHPO — PTI — pT HP 


By construction, S°% = —S#??, This tensor is conserved provided the T” = T”, a 
fact that follows from the same kind of calculation we did for the angular momentum, 
but now in Lorentz covariant form 


IpI =T + «PO, TH — T°? — 27,7" = 0 


where we’ve used both the symmetry of TJ” and the fact that it’s conserved, so 0,7” = 
0. 


The components of S“?? include the angular momentum, which can be found lurking 
in S% = ce" LF. The equation 0,5”? = 0 is then just conservation of angular mo- 
mentum of the field that we saw previously in (5.51). But that means there are also 
three more conserved quantities in this tensor, namely 0,,S“" = 0 for i = 1,2,3. What 
are these?! It’s simple to find the answer by expanding out 


S% — (rE — S'4) 


In fact, this has a rather straightforward meaning when S* = 0: it is just the “centre of 
mass”, or more precisely the centre of energy, of the field configuration. When S* Æ 0, 
there is an additional drift term. The fact that this is conserved is rather like a field- 
theoretic version of Newton’s first law which says that, in the absence of any force, 
a particle will continue at a constant speed. We see that after all these relativistic 
gymnastics, we come to something familiar, albeit in unfamiliar language. 
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6. Electromagnetic Radiation 


We've seen that Maxwell’s equations allow for wave solutions. This is light. Or, more 
generally, electromagnetic radiation. But how do you generate these waves from a 
collection of electric charges? In other words, how do you make light? 


We know that a stationary electric charge produce a stationary electric field. If we 
boost this charge so it moves at a constant speed, it produces a stationary magnetic 
field. In this section, we will see that propagating electromagnetic waves are created 
by accelerating charges. 


6.1 Retarded Potentials 


We start by simply solving the Maxwell equations for a given current distribution 
J" = (pc, J). We did this in Section 2 and Section 3 for situations where both charges 
and currents are independent of time. Here we’re going to solve the Maxwell equations 
in full generality where the charges and currents are time dependent. 


We know that we can solve half of Maxwell’s equations by introducing the gauge 
potential A, = (¢/c,—A) and writing Fay = 0,A, — O,A,. Then the remaining 
equations become 


OF = uoJ” => DA" — 8"(8, A”) = mJ” (6.1) 


where [O is the wave operator: O = 0,0" = (1/c?)6?/0t? — V?. 
This equation is invariant under gauge transformations 
At — A" + ohy (6.2) 


Any two gauge potentials related by the transformation (6.2) are considered physically 
equivalent. We will use this symmetry to help us solve (6.1). To do this we make a 
gauge choice: 


Claim: We can use the gauge symmetry (6.2) to choose A“ to satisfy 
0, A" = 0 (6.3) 


This is known as Lorentz Gauge. It was actually discovered by a guy named Lorenz who 
had the misfortune to discover a gauge choice that is Lorentz invariant: all observers 
will agree on the gauge condition (6.3). 
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Proof: Suppose you are handed a gauge potential A,, which doesn’t obey (6.3) but, 
instead, satisfies 0, A" = f for some function f. Then do a gauge transformation of 
the form (6.2). Your new gauge potential will obey 0, A” + Ox = f. This means that 


if you can find a gauge transformation y which satisfies Dy = f then your new gauge 
potential will be in Lorentz gauge. Such a y can always be found. This follows from 
general facts about differential equations. (Note that this proof is essentially the same 


as we used in Section 3.2.2 when proving that we could always choose Coulomb gauge 
V:-A=0). 


If we are in Lorentz gauge then the Maxwell equations (6.1) become particularly 
simple; they reduce to the sourced wave equation 


At = ne VY) A” = mJ" 6.4 
~\ 252 = bo (6.4) 


Our goal is to solve this equation, subject to the condition (6.3). We’ll assume that J 
has compact spatial support, meaning that the charges and currents are restricted to 
some finite region of space. As an aside, notice that this is the same kind of equation 
as Ly = —f which we needed to solve to go Lorentz gauge in the first place. This 
means that the methods we develop below will allow us to figure out both how to go 


to Lorentz gauge, and also how to solve for A, once we’re there. 


In the following, we’ll solve (6.4) in two (marginally) different ways. The first way is 
quicker; the second way gives us a deeper understanding of what’s going on. 


6.1.1 Green’s Function for the Helmholtz Equation 


For our first method, we will Fourier transform A, and J, in time, but not in space. 


We write 
ToS dw ~ ay saa dw ~ —iw 
A(x, t) =. g nx w) € t and Jed) af. 57 Ju(xw) e t 
Now the Fourier components ÄÃ,„(Xx, w) obey the equation 
ow \ 3 7 
(v i =) EEE (6.5) 


This is the Helmholtz equation with source given by the current J. 
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When w = 0, the Helmholtz equation reduces to the Poisson equation that we needed 
in our discussion of electrostatics. We solved the Poisson equation using the method 
of Green’s functions when discussing electrostatics in Section 2.2.3. Here we’ll do the 
same for the Helmholtz equation. The Green’s function for the Helmholtz equation 
obeys 


2 
W . 
(v + =) G(x;x’) = (x — x’) 
Translational and rotational invariance ensure that the solutions to this equation are of 


the form G.,(x;x’) = G(r) with r = |x — x’|. We can then write this as the ordinary 
differential equation, 


ld T 


w? 3 
r?dr dr ) 1 c2 GaSe) (6.6) 


We want solutions that vanish as r — oo. However, even with this restriction, there 
are still two such solutions. Away from the origin, they take the form 


tiwr/c 


Gy ~ 

m 

We will see shortly that there is a nice physical interpretation of these two Green’s 

functions. First, let’s figure out the coefficient that sits in front of the Green’s function. 

This is determined by the delta-function. We integrate both sides of (6.6) over a ball 
of radius R. We get 


R 2 
l d dG Ww 
4 drr? 2__* G| =1 
rf = E (+ Saa | 
Now, taking the limit R — 0, only the first term on the left-hand side survives. More- 


over, only the first term of dG,,/dr ~ (—1/r? + iw/cr)e+"/* survives. We find that 
the two Green’s functions for the Helmholtz equation are 


1 etiwr/c 


år r 


Note that this agrees with the Green’s function for the Poisson equation when w = 0. 


Retarded Potentials 


So which + sign should we take? The answer depends on what we want to do with the 
Green’s function. For our purposes, we'll nearly always need Gu ~ e+"/*/r. Let’s see 
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why. The Green’s function G, allows us to write the Fourier components A, in (6.5) 
as 


+iw|x—x'|/c 
A _ Ho 3 7 € 7 1 
A(x, w) = æ fa T Ta A 
which, in turn, means that the time-dependent gauge potential becomes 
—iw(t—|x—x’|/c) 


Ho dw 3.7 € 7 ! 
A, (x,t) = — | — | Ër —————_ J 
ust) a ý |x — x’| ule) 


But now the integral over w is just the inverse Fourier transform. With one difference: 
what was the time variable t has become the retarted time, tret, with 


Ctret = ct — |x — x'| 


We have our final result, 


/ 
A, (x,t) = = f da! Tbe) (6.7) 


[x= x| 


This is called the retarded potential. To determine the contribution at point x and time 
t, we integrate the current over all of space, weighted with the Green’s function factor 
1/|x — x'| which captures the fact that points further away contribute more weakly. 


After all this work, we’ve arrived at something rather nice. The general form of the 
answer is very similar to the result for electrostatic potential and magnetostatic vector 
potential that we derived in Sections 2 and 3. Recall that when the charge density and 
current were independent of time, we found 


(x) ex mE] and A(x) = Bo fate I(x!) 


~ Amey |x — x’| Ar |x — x’| 


But when the charge density and current do depend on time, we see from (6.7) that 
something new happens: the gauge field at point x and time t depends on the current 
configuration at point x’ and the earlier time tre = t — |x —x’|/c. This, of course, is due 
to causality. The difference t — tret is just the time it took the signal to propagate from 
x’ to x, travelling at the speed of light. Of course, we know that Maxwell’s equations 
are consistent with relativity so something like this had to happen; we couldn’t have 
signals travelling instantaneously. Nonetheless, it’s pleasing to see how this drops out 
of our Green’s functionology. 


= at = 


Finally, we can see what would happen were we to choose the other Green’s function, 
Gu ~ e"/¢/r, Following through the steps above, we see that the retarded time tret 
is replaced by the advanced time taav = t + |x — x’|/c. Such a solution would mean 
that the gauge field depends on what the current is doing in the future, rather than in 
the past. These solutions are typically thrown out as being unphysical. We’ll have (a 
little) more to say about them at the end of the next section. 


6.1.2 Green’s Function for the Wave Equation 


The expression for the retarded potential (6.7) is important. In this section, we provide 
a slightly different derivation. This will give us more insight into the origin of the 
retarded and advanced solutions. Moreover, the techniques below will also be useful in 
later courses’. 


We started our previous derivation by Fourier transforming only the time coordinate, 
to change the wave equation into the Helmholtz equation. Here we'll treat time and 
space on more equal footing and solve the wave equation directly. We again make use 
of Green’s functions. The Green’s function for the wave equation obeys 


(v — Z) G(x, t; x’, t) = P(x — x’)ó(t — t’) (6.8) 


Translational invariance in space and time means that the Green’s function takes the 
form G(x, t; x’, t) = G(x—x’,t—t’). To determine this function G(r, t), with r = x—x’, 
we Fourier transform both space and time coordinates, 


G(x, t) = | Sor kw) ae (6.9) 


Choosing x’ = 0 and t’ = 0, the wave equation (6.8) then becomes 


Equating the terms inside the integral, we see that the Fourier transform of the Green’s 
function takes the simple form 


2 1 
G(k,w) = “pate Pe 


4A very similar discussion can be found in the lecture notes on Quantum Field Theory. 
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But notice that this diverges when w? = c?k?. This pole results in an ambiguity in the 
Green’s function in real space which, from (6.9), is given by 


dw dk 1 je eee 
od E (27)* k? — w/e grae 


We need some way of dealing with that pole in the integral. To see what’s going on, 
it’s useful to change to polar coordinates for the momentum integrals over k. This will 
allow us to deal with that e“*? factor. The best way to do this is to think of fixing r 
and then choosing the k, axis to point along x. We then write k -r = kr cos 0, and the 
Green’s function becomes 


or 
G(r, t) = -a af dé sino f awk? fd =e 


Now the dọ integral is trivial, while the d0 integral is 


T ; 1 % d . 1 sin kr 
ing ikr cos 0 an ikr cos 0 TON —ikr _ „tikr L 
/ dé sind e one d8 E | Tir le e ] 2 Er 


After performing these angular integrals, the real space Green’s function becomes 


1 sin kr f7% ee 
G(r,t) = —~ dk ek? —— f d 
(r,t) 4r? [ kr Jo (w — ck)(w + ck) 


Now we have to face up to those poles. We’ll work by fixing k and doing the w integral 
first. (Afterwards, we’ll then have to do the k integral). It’s clear that we run into two 


poles at w = +ck when we do the w integral and we need a prescription for dealing 
with these. To do this, we need to pick a contour C in the complex w plane which 
runs along the real axis but skips around the poles. There are different choices for C. 
Each of them provides a Green’s function which obeys (6.8) but, as we will now see, 
these Green’s functions are different. What’s more, this difference has a nice physical 
interpretation. 


Retarded Green’s Function 


To proceed, let’s just pick a particular C and see what happens. We choose a contour 
which skips above the poles at w = +ck as shown in the diagram. This results in 
what’s called the retarded Greens function; we denote it as Grelr, t). As we now show, 
it depends crucially on whether t < 0 or t > 0. 
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Let’s first look at the case with t < 0. Here, Im) 
et — 0 when w — +ioo. This means that, for t < 0, P 
we can close the contour C in the upper-half plane as 
shown in the figure and the extra semi-circle doesn’t give 


rise to any further contribution. But there are no poles eò M 
- + e(@) 


in the upper-half plane. This means that, by the Cauchy 


residue theorem, Gyet(r,t) = 0 when t < 0. : 
Figure 50: 


In contrast, when t > 0 we have e™™* — 0 when 
w — —ioo, which means that we get to close the contour in the lower-half plane. Now 
we do pick up contributions to the integral from the two poles at w = +ck. This time 
the Cauchy residue theorem gives 


d e wt i e ickt etickt 
i: ” (w — ck)(w + ck) acs | 2ck  2ck | 


20 
= —— sin ckt t>0 
zg T C ( ) 
So, for t > 0, the Green’s function becomes 
1 1 f” 
Gret(r,t) = -a =f dk c sin kr sin ckt 


EENS ie ; ; ) ; 
= aa, | dk cc = e jer _ ge) 


4 


Each of these final integrals is a delta-function of the form 6(r + ct). But, obviously, 
r > 0 while this form of the Green’s function is only valid for t > 0. So the 6(r + ct) 
terms don’t contribute and we’re left with 


1 1 f pE (cibrtet) 4 er ik(rtet) _ pik(r—ct) _ p-ik(r-ct)) 


1 
Gres (t,t) = — =< 4(r — ct) t>0 


We can absorb the factor of c into the delta-function. (Recall that 6(a/a) = |a|d(x) for 
any constant a). So we finally get the answer for the retarded Green’s function 


Galet) 0 t<0 
re r,t = 1 
i —— 5 (trot) t>0 
Arr 

where tret is the retarded time that we met earlier, 

r 

tret = t — — 
Č 


The delta-function ensures that the Green’s function is only non-vanishing on the light- 
cone emanating from the origin. 
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Finally, with the retarded Green’s function in hand, we can construct what we really 
want: solutions to the wave equation (6.4). These solutions are given by 


A(x, t) = -mo | da'at GriealX, t; x’, t) Jax, t) (6.10) 
Ho 3I yal Ô (tret) A i 
a t 
3 d°x'd P uae 
= Ho f tgp Sula! strt) 
An |x — x’| 


Happily, we find the same expression for the retarded potential that we derived previ- 
ously in (6.7). 


Advanced Green’s Function 


Let us briefly look at other Green’s functions. We can BO 


pick the contour C in the complex w-plane to skip below m 3 =o 


the two poles on the real axis. This results in what’s 
called the advanced Green’s function. Now, when t > 
0, we complete the contour in the lower-half plane, as 


shown in the figure, where the lack of poles means that 
the advanced Green’s function vanishes. Meanwhile, for ; 
t < 0, we complete the contour in the upper-half plane Pipure pk 


and get 


where 
j 
taav = t + — 
c 
The resulting solution gives a solution known as the advanced potential, 


Ho 307 Oe ady) 
A t) = — | dr < 
„(x t) F j |x — x’| 
It’s hard to think of this solution as anything other than unphysical. Taken at face 
value, the effect of the current and charges now propagates backwards in time to de- 
termine the gauge potential A,,. The sensible thing is clearly to throw these solutions 
away. 
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However, it’s worth pointing out that the choice of the retarded propagator Gye 
rather than the advanced propagator Gaav is an extra ingredient that we should add to 
the theory of electromagnetism. The Maxwell equations themselves are time symmetric; 
the choice of which solutions are physical is not. 


There is some interesting history attached to this. A number of physicists have felt 
uncomfortable at imposing this time asymmetry only at the level of solutions, and 
attempted to rescue the advanced propagator in some way. The most well-known of 
these is the Feynman-Wheeler absorber theory, which uses a time symmetric propaga- 
tor, with the time asymmetry arising from boundary conditions. However, I think it’s 
fair to say that these ideas have not resulted in any deeper understanding of how time 
emerges in physics. 


Finally, there is yet another propagator that we can 


sss ; ee a eS) 
use. This comes from picking a contour C that skips x | A 
under the first pole and over the second. It is known as 
the Feynman propagator and plays an important role in Figure 52: 


quantum field theory. 


6.1.3 Checking Lorentz Gauge 


There is a loose end hanging over from our previous discussion. We have derived the 
general solution to the wave equation (6.4) for A,,.This is given by the retarded potential 


/ 
Aas mo fa Jul tret) (6.11) 


AT |x — x’| 


But the wave equation is only equivalent to the Maxwell equations if it obeys the 
Lorentz gauge fixing condition, 0,,A" = 0. We still need to check that this holds. In 
fact, this follows from the conservation of the current: ô„J” = 0. To show this, it’s 
actually simplest to return to a slightly earlier form of this expression (6.10) 


A, (x,t) = -mo f Pe'ar Gie (X, t; x’, t) Jux, t) 
The advantage of this is that both time and space remain on an equal footing. We have 
ð, A" (x,t) = -mo | Paat I Grax, t; xX’, t) JEX, t) 


But now we use the fact that Gret(X, t; x’, t’) depends on x — x’ and t — t to change the 
derivative ð, acting on x into a derivative a, acting on x’. We pick up a minus sign for 
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our troubles. We then integrate by parts to find, 
OAR) = +o | a'ar O Grelx, tix, t) J (x,t) 


= -mo f txat Gril t; x, t) OJ" xf) 
= 0 


as required. If you prefer, you can also run through the same basic steps with the form 
of the solution (6.11). You have to be a little careful because tre now also depends on 
x and x’ so you get extra terms at various stages when you differentiate. But it all 
drops out in the wash and you again find that Lorentz gauge is satisfied courtesy of 
current conservation. 


6.2 Dipole Radiation 


Let’s now use our retarded potential do understand something new. This is the set- 
up: there’s some localised region V in which there is a time-dependent distribution of 
charges and currents. But we’re a long way from this region. We want to know what 
the resulting electromagnetic field looks like. 


Our basic formula is the retarded potential, 


/ 
A,(x,t) = 42 f da! Tbe) (6.12) 
V Ix — x’| 

The current J“(x’,t) is non-zero only for x’ € V. We denote the size of the region V 
as d and we’re interested in what’s happening at a point x which is a distance r = |x| 
away. (A word of warning: in this section we’re using r = |x| which differs from our 
notation in Section 6.1 where we used r = |x — x’|). If |x —x’| > d for all x’ € V then 
we can approximate |x—x’| ~ |x| = r. In fact, we will keep the leading order correction 
to this which we get by Taylor expansion. (This is the same Taylor expansion that we 
needed when deriving the multipole expansion for electrostatics in Section 2.2.3). We 


have 
M / 
a ee 
> l _1 | a (6.13) 
jx—x’] ro r3 _ ` 


There is a new ingredient compared to the electrostatic case: we have a factor of |x—x'| 
that sits inside tre = t — |x — x’|/c as well, so that 


J (X, tet) = Ju(x’,t-—r/e+x-x'/ret+...) 
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Now we’d like to further expand out this argument. But, to do that, we need to know 
something about what the current is doing. We will assume that the motion of the 
charges and current are non-relativistic so that the current doesn’t change very much 
over the time T ~ d/c that it takes light to cross the region V. For example, if the 
muat) 


current varies with characteristic frequency w (so that J ~ e , then this requirement 


becomes d/c < 1/w. Then we can further Taylor expand the current to write 


x- x 


Jax, trt) = Jax, t — r/c) + J (x,t —r/c) 


bias 14 
rc - (9a) 


We start by looking at the leading order terms in both these Taylor expansions. 


6.2.1 Electric Dipole Radiation 
At leading order in d/r, the retarded potential becomes simply 


Arr 


A,,(x, t) © i [ev J, (x,t — r/c) 


This is known as the electric dipole approximation. (We’ll see why very shortly). We 
want to use this to compute the electric and magnetic fields far from the localised source. 
It turns out to be simplest to first compute the magnetic field using the 3-vector form 
of the above equation, 


A(x, t) & i [ee J(x’,t — r/c) 
We can manipulate the integral of the current using the conservation formula p+ V - 
J = 0. (The argument is basically a repeat of the kind of arguments we used in the 
magnetostatics section 3.3.2). We do this by first noting the identity 


Od aig) = (Oj) £i J, = pai J; 


We integrate this over all of space and discard the total derivative to find 


[ee Jix) = TE px)x =p 


where we recognise p as the electric dipole moment of the configuration. We learn that 
the vector potential is determined by the change of the electric dipole moment, 


A(x, t) & E pl —r/c) 


This, of course, is where the electric dipole approximation gets its name. 
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We now use this to compute the magnetic field B = V x A. There are two contri- 
butions: one when V acts on the 1/r term, and another when V acts on the r in the 
argument of p. These give, respectively, 

Ho 


Ho an ERS 
Tar? ** p(t — r/c) — Irc $ ” p(t — r/c) 


where we've used the fact that Vr = x. Which of these two terms is bigger? As we 
get further from the source, we would expect that the second, 1/r, term dominates 
over the first, 1/r? term. We can make this more precise. Suppose that the source is 
oscillating at some frequency w, so that p ~ wp. We expect that it will make waves 
at the characteristic wavelength A = c/w. Then, as long we’re at distances r >> A, the 
second term dominates and we have 


B(t,x) x -£ x p(t— r/o) (6.15) 
A4rrc 
The region r > is called the far-field zone or, sometimes, the radiation zone. We’ve 
now made two successive approximations, valid if we have a hierarchy of scales in our 
problem: r >> A> d. 


To get the corresponding electric field, it’s actually simpler to use the Maxwell equa- 
tion E = œ V x B. Again, if we care only about large distances, r >> A, the curl of B 
is dominated by V acting on the argument of p. We get 


a Lo wes 
VxBr Inra ** (x x p(t—r/c)) 
=> Ex Eg x (& x P(t- r/o) (6.16) 


Notice that the electric and magnetic field are related in the same way that we saw for 
plane waves, namely 


E=-cx xB 
although, now, this only holds when we’re suitably far from the source, r >> A. What’s 
happening here is that oscillating dipole is emitting spherical waves. At radius r > A 


these can be thought of as essentially planar. 


Notice, also, that the electric field is dropping off slowly as 1/r. This, of course, is 
even slower than the usual Coulomb force fall-off. 
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6.2.2 Power Radiated: Larmor Formula 


We can look at the power radiated away by the source. This is computed by the 
Poynting vector which we first met in Section 4.4. It is given by 


1 C x Ho r £0) x 

S = —E x B= —|B’*x = ——— |x x pl? x 
The fact that S lies in the direction X means that the power is emitted radially. The fact 
that it drops off as 1/r? follows from the conservation of energy. It means that the total 
energy flux, computed by integrating S over a large surface, is constant, independent 


of r. 
Although the radiation is radial, it is not uniform. Suppose that the dipole oscillates 
in the z direction. Then we have 


167?r2c 


|p)? sin? 6x (6.17) 


where @ is the angle between x and the z-axis. The emitted power is largest in the 
plane perpendicular to the dipole. A sketch of this is shown in the figure. 


A device which converts currents into electro- 
magnetic waves (typically in the radio spectrum) is 
called an antenna. We see that it’s not possible to cre- 
ate a dipole antenna which emits radiation uniformly. 
There’s actually some nice topology underlying this 
observation. Look at a sphere which surrounds the 
antenna at large distance. The radiation is emitted 


radially, which means that the magnetic field B lies 
tangent to the sphere. But there’s an intuitive result Figure 53: 

in topology called the hairy ball theorem which says 

that you can’t smoothly comb the hair on a sphere. Or, more precisely, there does not 
exist a nowhere vanishing vector field on a sphere. Instead, any vector field like B must 
vanish at two or more points. In this present context, that ensures that S too vanishes 
at two points. 


The total radiated power, P, is computed by integrating over a sphere, 


27 T 
p= Pr- S = ir f d | dO sin? 0 
S2 1672c 0 0 


where one of the factors of sin@ comes from the Jacobian. (In Section 5.6.2, we called 


the momentum density vector as P. This is not to be confused with the power here 
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which denoted by the same letter P.) The integral is easily performed, to get 
P = — |p (6.18) 


Finally, the dipole term p is still time dependent. It’s common practice to compute the 
time averaged power. The most common example is when the dipole oscillates with 
frequency w, so that |p|? ~ cos?(wt). (Recall that we’re only allowed to work with 
complex expressions when we have linear equations). Then, integrating over a period, 
T = 27/w, just gives an extra factor of 1/2. 


Let’s look at a simple example. Take a particle of charge Q, oscillating in the z 
direction with frequency w and amplitude d. Then we have p = pze’“’ with the dipole 


moment p = Qd. Similarly, p = —w?pze’. The end result for the time averaged power 
P is 
2,4 
5 _ Hop W 
P= 6.19 
127c ( ) 


This is the Larmor formula for the time-averaged power radiated by an oscillating 
charge. The formula is often described in terms of the acceleration, a = dw?. Then it 
reads 


Qa? 


p= 
127€9c3 


(6.20) 


where we’ve also swapped the pp in the numerator for eoc? in the denominator. 


6.2.3 An Application: Instability of Classical Matter 


The popular picture of an atom consists of a bunch of electrons 
orbiting a nucleus, like planets around a star. But this isn’t what 
an atom looks like. Let’s see why. 


We’ll consider a Hydrogen atom, with an electron orbiting around 
a proton, fixed at the origin. (The two really orbit each other 


around their common centre of mass, but the mass of the electron 


is Mme œ 9 x 1073! Kg, while the mass of the proton is about 1800 
Figure 54: This is 


bigger, so this is a good approximation). The equation of motion 
not what an atom 


for the electron is 


looks like. 
e f 
Mk = 
° ATE r? 
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The dipole moment of the atom is p = er so the equation of motion tells us p. Plugging 
this into (6.18), we can get an expression for the amount of energy emitted by the 


_ Ho e? 
~ 6m \ 4reomer? 


As the electron emits radiation, it loses energy and must, therefore, spiral towards the 


electron, 


nucleus. We know from classical mechanics that the energy of the orbit depends on its 
eccentricity. For simplicity, let’s assume that the orbit is circular with energy 


e 1 


4Teo 2r 


Then we can equate the change in energy with the emitted power to get 


2 
e Ho e? 
8m €or Gre \ Atenmer 


which gives us an equation that tells us how the radius of the orbit changes, 


Hoet 


E 2 
12r?ceom2r? 


Suppose that we start at some time, t = 0, with a classical orbit with radius ro. Then 
we can calculate how long it takes for the electron to spiral down to the origin at r = 0. 


It is 
m a= [ze _ An? a aye 
Hoe 


Now let’s plug in some small numbers. We can take the size of the atom to be ro ~ 


5 x 1071m. (This is roughly the Bohr radius that can be derived theoretically using 
quantum mechanics). Then we find that the lifetime of the hydrogen atom is 


T x 107 s 


That’s a little on the small size. The Universe is 14 billion years old and hydrogen 
atoms seem in no danger of decaying. 


Of course, what we’re learning here is something dramatic: the whole framework of 
classical physics breaks down when we look at the atomic scale and has to be replaced 
with quantum mechanics. And, although we talk about electron orbits in quantum 
mechanics, they are very different objects than the classical orbits drawn in the picture. 
In particular, an electron in the ground state of the hydrogen atom emits no radiation. 
(Electrons in higher states do emit radiation with some probability, ultimately decaying 
down to the ground state). 
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6.2.4 Magnetic Dipole and Electric Quadrupole Radiation 


The electric dipole approximation to radiation is sufficient for most applications. Ob- 
vious exceptions are when the dipole p vanishes or, for some reason, doesn’t change in 
time. For completeness, we describe here the leading order corrections to the electric 
dipole approximations. 


The Taylor expansion of the retarded potential was given in (6.13) and (6.14). 
Putting them together, we get 


' tre 
A, (x,t) = mf ate! Jul tres) 


T |x — x’| 
Ho 1 1 7 1 xox l xox i 
= E f da (t-A + At = r/o) E ) (: a ) Lo. 


The first term is the electric dipole approximation that we discussed in above. We will 
refer to this as AU Corrections to this arise as two Taylor series. Ultimately we will 
only be interested in the far-field region. At far enough distance, the terms in the first 
bracket will always dominate the terms in the second bracket, which are suppressed by 
1/r. We therefore have 


Arr2c 


Aj, (x, t) ~ AP (x,t) + al | da! (x -x')J,,(x',t — r/c) 
A 


As in the electric dipole case, it’s simplest if we focus on the vector potential 


~ AED Lo 3! al) Wy! t 
A(x,t)h x A ost) + gi fate (x- x) I(x’, t — r/c) (6.21) 


The integral involves the kind of expression that we met first when we discussed mag- 
netic dipoles in Section 3.3.2. We use the slightly odd expression, 


Because J in (6.21) is a function of x’, we apply this identity to the Jiz} terms in the 
expression. We drop the boundary term at infinity, remembering that we’re actually 
dealing with J rather than J, write the integral above as 


. Ta . . T 
fer pad = =f f ate (aj; — J; + PE) 
Then, using the appropriate vector product identity, we have 


. 1 . 1 ; 
pee (xs )J = 5X x pavixxss fate (x -x’)x’ p 
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Using this, we may write (6.21) as 
A(x, t) x APP (x,t) + AMG t) + AF? (x,t) 


where AMP is the magnetic dipole contribution and is given by 


AMP (x,t) = a XX pee x’ x J(x’,t — r/c) (6.22) 
nr?c 


and AF® is the electric quadrupole contribution and is given by 


AFS(x, t) = a pee (x - x’)x’ p(x’,t — r/c) (6.23) 


8rrĉ?c 
The names we have given to each of these contributions will become clearer as we look 
at their properties in more detail. 
Magnetic Dipole Radiation 


Recall that, for a general current distribution, the magnetic dipole m is defined by 


m = 5 [ee x x J(x’) 


The magnetic dipole contribution to radiation (6.22) can then be written as 


AMD (x, t) =- & x m(t- 
(x,t) = -è x a(t = r/c) 
This means that varying loops of current will also emit radiation. Once again, the 
leading order contribution to the magnetic field, B = V x A, arises when the curl hits 
the argument of m. We have 


Using the Maxwell equation EM? = c?V x BMP to compute the electric field, we have 


EMP (x,t) = es x ta(t — r/c) 


The end result is very similar to the expression for B and E that we saw in (6.15) and 
(6.16) for the electric dipole radiation. This means that the radiated power has the 
same angular form, with the Poynting vector now given by 


GMD _ eral’ sin? 02 (6.24) 
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Integrating over all space gives us the power emitted, 


Ho |... 


This takes the same form as the electric dipole result (6.18), but with the electric dipole 
replaced by the magnetic dipole. Notice, however, that for non-relativistic particles, the 
magnetic dipole radiation is substantially smaller than the electric dipole contribution. 
For a particle of charge Q, oscillating a distance d with frequency w, we have p ~ Qd 
and m ~ Qd?w. This means that the ratio of radiated powers is 


MD 2 2 2 
P dfw v 


where v is the speed of the particle. 


Electric Quadrupole Radiation 


The electric quadrupole tensor Q;; arises as the 1/r* term in the expansion of the 
electric field for a general, static charge distribution. It is defined by 


Qy = pee p(x’) (aut, = bij”) 


This is not quite of the right form to account for the contribution to the potential 
(6.23). Instead, we have 


ACM, i= ae (xid,¢ —rfe)+ n f de r’ pla’, t— r/o) 


24Tr?c 


The second term looks like a mess, but it doesn’t do anything. This is because it’s 
radial and so vanishes when we take the curl to compute the magnetic field. Neither 
does it contribute to the electric field which, in our case, we will again determine from 
the Maxwell equation. This means we are entitled to write 


H 
AFS (x,t) = Eo x- Q(t —r/c) 


where (x - Q); = x;Q;;. Correspondingly, the magnetic and electric fields at large 


distance are 


Lo R os 
BPS (x,t) x anra $ ” ($ - Q) 


a A ea (& Q- 2) - ($ ü) 


247rec 


We may again compute the Poynting vector and radiated power. The details depend on 
the exact structure of Q, but the angular dependence of the radiation is now different 
from that seen in the dipole cases. 
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Finally, you may wonder about the cross terms between the ED, MD and EQ com- 
ponents of the field strengths when computing the quadratic Poynting vector. It turns 
out that, courtesy of their different spatial structures, these cross-term vanish when 
computing the total integrated power. 


6.2.5 An Application: Pulsars 


Pulsars are lighthouses in the sky, spinning neutron stars continuously beaming out 
radiation which sweeps past our line of sight once every rotation. They have been 
observed with periods between 107? seconds and 8 seconds. 


Neutron stars typically carry a very large magnetic field. This arises from the parent 
star which, as it collapses, reduces in size by a factor of about 10°. This squeezes the 
magnetic flux lines, which gets multiplied by a factor of 10'°. The resulting magnetic 
field is typically around 10° Tesla, but can be as high as 10'! Tesla. For comparison, 
the highest magnetic field that we have succeeded in creating in a laboratory is a paltry 
100 Tesla or so. 


The simplest model of a pulsar has the resulting magnetic Q 
dipole moment m of the neutron star misaligned with the angular = 
velocity. This resulting magnetic dipole radiation creates the desired 


lighthouse effect. Consider the set-up shown in the picture. We take 
the pulsar to rotate about the z-axis with frequency Q. The magnetic 
moment sits at an angle a relative to the z-axis, so rotates as 


m = my (sin(a) sin(Qt)xX + sin(a) cos(Qt)y¥ + cos aĉ) Figure 55: 


The power emitted (6.25) is then 


At the surface of the neutron star, it’s reasonable to assume that the magnetic field is 
given by the dipole moment. In Section 3.3, we computed the magnetic field due to a 
dipole moment: it is 


Toa 


where R is the radius of the star. This means that Bmax = Homo/27R? and the power 
emitted is 
B 2r RÊ B? 


3 ua Of sin? a (6.26) 
0 
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Figure 56: A composite image of the Crab Nebula, taken by the Hubble, Chandra and 
Spitzer space telescopes. 


Because the pulsar is emitting radiation, it must lose energy. And this means it slows 
down. The rotational energy of a the pulsar is given by 


1 
E==10? 
2 
where I = 2M R? is the moment of inertia of a sphere of mass M and radius R. 


Equating the power emitted with the loss of rotational kinetic energy gives 
P =—E=—-I00 (6.27) 


Let’s put some big numbers into these equations. In 1054, Chinese astronomers saw a 
new star appear in the sky. 6500 light years away, a star had gone supernova. It left 
behind a pulsar which, today, emits large quantities of radiation, illuminating the part 
of the sky we call the Crab nebula. This is shown in the picture. 


The Crab pulsar has mass M ~ 1.4Mgun © 3 x 10°° kg and radius R ~ 15 km. It 
spins about 30 times a second, so Q & 607 s~!. It’s also seen to be slowing down with 
Q = —2 x 107° s~?. From this information alone, we can calculate that it loses energy 
at arate of E = INQ ~ —10°? Js~!. That’s a whopping amount of energy to be losing 
every second. In fact, it’s enough energy to light up the entire Crab nebula. Which, of 
course, it has to be! Moreover, we can use (6.26) and (6.27) to estimate the magnetic 
field on the surface of the pulsar. Plugging in the numbers give Bmax sina ~ 10° Tesla. 
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6.3 Scattering 


In this short section, we describe the application of our radiation formulae to the 
phenomenon of scattering. Here’s the set-up: an electromagnetic wave comes in and 
hits a particle. In response, the particle oscillates and, in doing so, radiates. This new 
radiation moves out in different directions from the incoming wave. This is the way 
that light is scattered. 


6.3.1 Thomson Scattering 


We start by considering free, charged particles where the process is known as Thomson 
scattering. The particles respond to an electric field by accelerating, as dictated by 
Newton’s law 


mx = qE 


i(kr—wt) To solve for the motion of the 


The incoming radiation takes the form E = Epe 
particle, we’re going to assume that it doesn’t move very far from its central position, 
which we can take to be the origin r = 0. Here, “not very far” means small compared 
to the wavelength of the electric field. In this case, we can replace the electric field by 


E ~ Eye“, and the particle undergoes simple harmonic motion 


E 
x(t) = a sin(wt) 


We should now check that the motion of the particle is indeed small compared to the 
wavelength of light. The maximum distance that the particle gets is £max = qEo/Mm%?, 
so our analysis will only be valid if we satisfy 

gin _ c qEo 


L > 2121 (6.28) 
mu W MUIC 


This requirement has a happy corollary, since it also ensures that the maximum speed 
of the particle Umax = qEo/mw < c, so the particle motion is non-relativistic. This 
means that we can use the dipole approximation to radiation that we developed in the 
previous section. We computed the time-averaged radiated power in (6.20): it is given 
by 


4 p2 

5 Log Eo 
Pra iated — 

diated  120m2c 


It’s often useful to compare the strength of the emitted radiation to that of the incoming 
radiation. The relevant quantity to describe the incoming radiation is the time-averaged 
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magnitude of the Poynting vector. Recall from Section 4.4 that the Poynting vector 
for a wave with wavevector k is 
1 F2 + 
S=—ExB=—%K sin?(k - x — wt) 
Ho Ho 
Taking the time average over a single period, T = 27/w, gives us the average energy 
flux of the incoming radiation, 


» ck? 
Dineident a 
2/0 


with the factor of two coming from the averaging. The ratio of the outgoing to incoming 
powers is called the cross-section for scattering. It is given by 


D 2 4 

ee Piadiated = Lod 
— Q 22 
Sincident 6mm? c 


The cross-section has the dimensions of area. To highlight this, it’s useful to write it 
as 
87 


o= a (6.29) 


where the length scale r4 is known as the classical radius of the particle and is given by 


g 


AT EQN g 


= me 


This equation tells us how to think of rz. Up to some numerical factors, it equates 
the Coulomb energy of a particle in a ball of size rg with its relativistic rest mass. 
Ultimately, this is not the right way to think of the size of point particles. (The right 
way involves quantum mechanics). But it is a useful concept in the classical world. For 
the electron, re © 2.8 x 107 m. 


The Thompson cross-section (6.29) is slightly smaller than the (classical) geometric 
cross-section of the particle (which would be the area of the disc, Anr?). For us, the 
most important point is that the cross-section does not depend on the frequency of 
the incident light. It means that all wavelengths of light are scattered equally by 
free, charged particles, at least within the regime of validity (6.28). For electrons, the 
Thomson cross-section is o œ% 6 x 107% m?. 
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Figure 57: Now you know why. 


6.3.2 Rayleigh Scattering 


Rayleigh scattering describes the scattering of light off a neutral atom or molecule. 
Unlike in the case of Thomson scattering, the centre of mass of the atom does not 
accelerate. Instead, as we will see in Section 7.1.1, the atom undergoes polarisation 


p=akE 


We will present a simple atomic model to compute the proportionality constant in 
Section 7.5.1, where we will show that it takes the form (7.29), 


B q/m 
—w? + we — iyw 


Here wo is the natural oscillation frequency of the atom while w is the frequency of 
incoming light. For many cases of interest (such as visible light scattering off molecules 
in the atmosphere), we have wọ >> w, and we can approximate a as a constant, 


i 


2 
wm 


We can now compute the time-average power radiated in this case. It’s best to use 
the version of Larmor’s fomula involving the electric dipole (6.19), since we can just 
substitute in the results above. We have 


p joa? Fu 
radiated = 
127c 
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In this case, the cross-section for Rayleigh scattering is given by 


D 2,4 4 2 4 
pe P radiated — Hoad ( wW ) = Snr ( wW ) 
= = = 33 = 
Sincident 6nrm?c Wo 3 Wo 


We see that the cross-section now has more structure. It increases for high frequencies, 


o ~ wt or, equivalently, for short wavelengths o ~ 1/\*. This is important. The most 
famous example is the colour of the sky. Nitrogen and oxygen in the atmosphere scatter 
short-wavelength blue light more than the long-wavelength red light. This means that 
the blue light from the Sun gets scattered many times and so appears to come from all 
regions of the sky. In contrast, the longer wavelength red and yellow light gets scattered 
less, which is why the Sun appears to be yellow. (In the absence of an atmosphere, the 
light from the Sun would be more or less white). This effect is particularly apparent at 
sunset, when the light from the Sun passes through a much larger slice of atmosphere 
and, correspondingly, much more of the blue light is scattered, leaving behind only red. 


6.4 Radiation From a Single Particle 


In the previous section, we have developed the multipole expansion for radiation emitted 
from a source. We needed to invoke a couple of approximations. First, we assumed 
that we were far from the source. Second, we assumed that the motion of charges and 
currents within the source was non-relativistic. 


In this section, we’re going to develop a formalism which does not rely on these 
approximations. We will determine the field generated by a particle with charge q, 
moving on an arbitrary trajectory r(r), with velocity v(t) and acceleration a(t). It 
won’t matter how far we are from the particle; it won’t matter how fast the particle is 
moving. The particle has charge density 


p(x, t) =g (x — r(t)) (6.30) 
and current 
J(x, t) = qv(t)o°(x — r(t)) (6.31) 


Our goal is find the general solution to the Maxwell equations by substituting these 
expressions into the solution (6.7) for the retarded potential, 


i tre 
At) = E f Pa! rtre) (6.32) 


|x = x'| 


The result is known as Liénard-Wierchert potentials. 
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6.4.1 Liénard-Wierchert Potentials 


If we simply plug (6.30) into the expression for the retarded electric potential (6.32), 
we get 


(xt) = gh fata’ 5 2 gË — Flt) 
Here we’re denoting the position of the particle as r(t), while we’re interested in the 
value of the electric potential at some different point x which tdoes not lie on the 
trajectory r(t). We can use the delta-function to do the spatial integral, but it’s a little 
cumbersome because the x’ appears in the argument of the delta-function both in the 
obvious place, and also in tret = t — |x — x’|/c. It turns out to be useful to shift this 
awkwardness into a slightly different delta-function over time. We write, 


/ i / 1 f / t 
bet) f dt 1 Pa Bl A — tra) 
j 1 


q 
Are 
2 5(t =t- |x —r(t’)|/o) (6.33) 


dt 
ATEo / |x — r(t’)| 
We still have the same issue in doing the f dt’ integral, with ¢’ appearing in two places 


in the argument. But it’s more straightforward to see how to deal with it. We introduce 
the separation vector 


R(t) =x—-r(t) 


Then, if we define f(t’) = t + R(t’)/c, we can write 


ost) = ge fae aaa se) 


= q dt’ 1 = dees 
Amey fa df R(t’) a(t — fe) 
q E 1 | 

oF REN] reya 


ÅT Eo 


A quick calculation gives 


df R(t!) v(t) 
ee a 
dt! c 
with v(t) = r(t) = —R(t). This leaves us with our final expression for the scalar 
potential 
o(x,t) = 4 : . (6.34) 
dre [c= RE) vE) ROJ a | 
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Exactly the same set of manipulations will give us a similar expression for the vector 
potential, 


(6.35) 


Equations (6.34) and (6.35) are the Liénard-Wierchert potentials. In both expressions 
“ret” stands for “retarded” and means that they should be evaluated at time t deter- 
mined by the requirement that 


U+R(t')/c=t (6.36) 


This equation has an intuitive explanation. If you trace back 
light-sheets from the point x, they intersect the trajectory of 
the particle at time t’, as shown in the figure. The Liénard- 
Wierchert potentials are telling us that the field at point x 


is determined by what the particle was doing at this time t’. 


6.4.2 A Simple Example: A Particle Moving with 
Constant Velocity 


Figure 58: 


The Liénard-Wierchert potentials (6.34) and (6.35) have 
the same basic structure that we for the Coulomb law in electrostatics and the Biot- 
Savart law in magnetostatics. The difference lies in the need to evaluate the potentials 
at time t'. But there is also the extra factor 1/(1 — Ê- v/c). To get a feel for this, let’s 
look at a simple example. We’ll take a particle which moves at constant speed in the 
z direction, so that 


r(t) =t? => wit) 


To simplify life even further, we’ll compute the potentials at a point in the z = 0 plane, 
so that x = (x,y,0). We’ll ask how the fields change as the particle passes through. 
The equation (6.36) to determine the retarded time becomes 


t+ Vfxrt+y?+v2/ce=t 


Squaring this equation (after first making the right-hand side t—t’) gives us a quadratic 
in t’, 
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where we see the factor y = (1 — v?/c?)~'/?, familiar from special relativity naturally 
emerging. The quadratic has two roots. We’re interested in the one with the minus 
sign, corresponding to the retarded time. This is 


2 
(at CVPR (6.37) 


We now need to deal with the various factors in the numerator of the Liénard-Wierchert 
potential (6.34). Pleasingly, they combine together nicely. We have R(t’) = c(t — t’). 
Meanwhile, R(t’) - v(t’) = (x —r(t’)) -v = —r(t’) -v = —v’t since we’ve taken x to lie 


perpendicular to v. Put together, this gives us 


q 1 1 
Ox!) = 7+ web n 8) 
q 1 
 Ameg c(t — t) + vt! 
1 1 


Arey c(t — t'/y?) 


But, using our solution (6.37), this becomes 


q 1 
t)= 
O(X, ) 4TEo [v2t2 4 (x2 + y?) /y2]1/2 


Similarly, the vector potential is 


qHo ¥ 
A(x, t) = 
D= ia PEF ET 


How should we interpret these results? The distance from the particle to the point x is 
r? = x? +y? +vt?. The potentials look very close to those due to a particle a distance 
r away, but with one difference: there is a contraction in the x and y directions. Of 
course, we know very well what this means: it is the usual Lorentz contraction in special 
relativity. 


In fact, we previously derived the expression for the electric and magnetic field of a 
moving particle in Section 5.3.4, simply by acting with a Lorentz boost on the static 
fields. The calculation here was somewhat more involved, but it didn’t assume any rel- 
ativity. Instead, the Lorentz contraction follows only by solving the Maxwell equations. 
Historically, this kind of calculation is how Lorentz first encountered his contractions. 
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6.4.3 Computing the Electric and Magnetic Fields 


We now compute the electric and magnetic fields due to a particle undergoing arbitrary 
motion. In principle this is straightforward: we just need to take our equations (6.34) 
and (6.35) 


(x,t) = EA = R(t’) - v(t") R(t’) i 
_ 4Ho i ve) 
A(x,t) = An = R(t’) - v(t) RE) ret 


where R(t’) = x — r(t’). We then plug these into the standard expressions for the 
electric field E = —V@ — OA/Ot and the magnetic field B = V x A. However, in 
practice, this is a little fiddly. It’s because the terms in these equations are evaluated 
at the retarded time t’ determined by the equation t + R(t’)/c = t. This means that 
when we differentiate (either by 0/0t or by V), the retarded time also changes and 
so gives a contribution. It turns out that it’s actually simpler to return to our earlier 
expression (6.33), 


gxt) = gh J dt! an St- t — R(t’)/c) 


and a similar expression for the vector potential, 


t 
A(x,t) = m / dt sev — Ree /c) (6.38) 
This will turn out to be marginally easier to deal with. 


The Electric Field 


We start with the electric field E = —Vọ — OA/Ot. We call the argument of the 
delta-function 


s=t—t'— R(t’) 


We then have 


v= oh fae |-Tos) - 5000S] 


R? R 
o 4 I Pi at VR VR 
ATE Os 


-pz ols) = Reo () 


(6.39) 
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The Jacobian factor from changing the integral variable is the given by 


Os 


a = -1+ RU) -vie')/c 


This quantity will appear a lot in what follows, so we give it a new name. We define 
K=1-R(t’)-v(t’)/c 


so that Ot'/0s = —1/k. Integrating the second term in (6.39) by parts, we can then 


o q VR d (VR 
MS Arey i | kR? ds (= one) 
o q VR 1d /VR 
Arey Á | kR? kdt VAC 1) 
Meanwhile, the vector potential term gives 


OA — to f ay Voy 98 
Se ale Rm Oa 


write 


But 0s/0t = 1. Moving forward, we have 


OA Go d Ot 


Ə 4r |Jg 


_ _ gto ES Ai 
An É (=) a(s) 
-w fas 2 | 7 ( 

4T K 


Putting this together, we get 


a fa r ld (Ejo 


ATEO KR? «edt! KR 
R ld R -v/c 
kR? kcdt KR 


We're still left with some calculations to do. Specifically, we need to take the derivative 


= 9 
ATE 


(6.40) 


ret 


d/dt’. This involves a couple of small steps. First, 


dR d/R v Re 1 a 
THE) | (R-v) = -7 (v-(v-R)R) 


= [62's 


Also, 


d d 


=en = — = % = — š 3 2 — 2 
ap CP) TE (R-R-v/c) v-R+v*/c—R-a/c 


Putting these together, we get 


(Soe) =- (v = RJR) z he + Sai (v:-R-v?/c+R-a/c) 


We write the v - R terms as v-R = c(1— K). Then, expanding this out, we find that 
a bunch of terms cancel, until we’re left with 


d (AS) 7 R 4 c(R — v/c) 


(1 —v?/c?) + |(R - v/c)R-a— xa 


dt! KR R? kK? R? k? Rc 
c c(R-—v/c) Rx [|(R-— v/c) xa] 
= — .41 
R? a 2K? R? i K? Rce Pa 


where we’ve introduced the usual y factor from special relativity: y~? = 1/(1 — v?/c’). 
Now we can plug this into (6.40) to find our ultimate expression for the electric field, 


BN- i Rx [(R-v/o x | en 

gala R? Kee a 
Since it’s been a long journey, let’s recall what everything in this expression means. 
The particle traces out a trajectory r(t), while we sit at some position x which is 
where the electric field is evaluated. The vector R(t) is the difference: R = x —r. 
The ret subscript means that we evaluate everything in the square brackets at time t’, 
determined by the condition t + R(t’)/c = t. Finally, 


A 


k=1— 


1 
and =? = — 
c 4 1 — v?/c? 


The electric field (6.42) has two terms. 
e The first term drops off as 1/R?. This is what becomes of the usual Coulomb 
field. It can be thought of as the part of the electric field that remains bound to 


the particle. The fact that it is proportional to R, with a slight off-set from the 
velocity, means that it is roughly isotropic. 


e The second term drops off as 1/R and is proportional to the acceleration. This 
describes the radiation emitted by the particle. Its dependence on the acceleration 
means that it’s highly directional. 
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The Magnetic Field 


To compute the magnetic field, we start with the expression (6.38), 


A(x) = ef at Se 


with s =t—t — R(t’)/c. Then, using similar manipulations to those above, we have 


VR Vs xv 
R 


— _ Go / ata A $ 
B=VxA Ho fat | TA x vô(s) + o) 


qHo VR ld /(VRxv 
—— ——— — A 
An fo | RR YT kde KRec ale) eee) 
We've already done the hard work necessary to compute this time derivative. We can 
write, 

d (VRxv\_ d (R—v/c)xv 

dt’ KR dt! KR 


d (AS) eg a a 


~ dt! KR KR 


Now we can use (6.41). A little algebra shows that terms of the form v x a cancel, and 
we're left with 


dt’ 


d {Rxv 7 Rxv Rxv (R-a)Rxv_ Rxa 
KR 7 R? | KRR ck? R? © KR 


Substituting this into (6.43), a little re-arranging of the terms gives us our final expres- 
sion for the magnetic field, 


Rxv (R-a(Rxv/)+sRxa 
KBR? | ck? R 


qHo 


B=- 
At 


(6.44) 


ret 

We see that this has a similar form to the electric field (6.42). The first term falls off as 
1/R? and is bound to the particle. It vanishes when v = 0 which tells us that a charged 
particle only gives rise to a magnetic field when it moves. The second term falls off as 
1/R. This is generated by the acceleration and describes the radiation emitted by the 
particle. You can check that E in (6.42) and B in (6.44) are related through 

B= [Ree xE (6.45) 


as you might expect. 
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6.4.4 A Covariant Formalism for Radiation 


Before we make use of the Liénard-Wierchert potentials, we’re going to do something 
a little odd: we’re going to derive them again. This time, however, we’ll make use of 
the Lorentz invariant notation of electromagnetism. This won’t teach us anything new 
about physics and the results of this section aren’t needed for what follows. But it will 
give us some practice on manipulating these covariant quantities. Moreover, the final 
result will be pleasingly concise. 


A Covariant Retarded Potential 


We start with our expression for the retarded potential (6.32) in terms of the current, 
Ho Baal Ju(X', tret) 

A, (x,t) = — | da’ ———~ 6.46 

a ) Ar f |x = x’ | ( ) 

with tret = t—|x—x’|/c. This has been the key formula that we’ve used throughout this 

section. Because it was derived from the Maxwell equations, this formula should be 

Lorentz covariant, meaning that someone in a different inertial frame will write down 

the same equation. Although this should be true, it’s not at all obvious from the way 

that (6.46) is written that it actually is true. The equation involves only integration 

over space, and the denominator depends only on the spatial distance between two 
points. Neither of these are concepts that different observers agree upon. 


So our first task is to rewrite (6.46) in a way which is manifestly Lorentz covariant. 
To do this, we work with four-vectors X” = (ct,x) and take a quantity which everyone 
agrees upon: the spacetime distance between two points 


(X = XP = hul X" = XX” =X") = A= = [x - xP 


Consider the delta-function 6((X — X’)?), which is non-vanishing only when X and 
X’ are null-separated. This is a Lorentz-invariant object. Let’s see what it looks like 
when written in terms of the time coordinate t. We will need the general result for 
delta-functions 

d(x — xi) 


d(f(x)) = ` T (6.47) 


where the sum is over all roots f(x;) = 0. Using this, we can write 


5 (X = XY) = 4 (leU — t) + [x — x'e — t) — [x — x) 


Ti 


_ ôl eb ae x'|) ål — ct- |x- x']) 

7 Qclt — t'l 2Qclt — t'l 

= ôf ai ee |): ål — ct- |x- x']) 
2|x — x’| 2|x — x’| 
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The argument of the first delta-function is ct’ — Cty, and so this term contributes only 
if t < t. The argument of the second delta-function is ct’ — ctagy and so this term can 
contribute only contribute if t > t. But the temporal ordering of two spacetime points 
is also something all observers agree upon, as long as those points are either timelike 
or null separated. And here the delta-function requires the points to be null separated. 
This means that if we picked just one of these terms, that choice would be Lorentz 
invariant. Mathematically, we do this using the Heaviside step-function 


We have 


d(ct! — ctret) 


6((X -X’))O(¢-t) = axa x! 


(6.48) 


The left-hand side is manifestly Lorentz invariant. The right-hand side doesn’t look 
Lorentz invariant, but this formula tells us that it must be! Now we can make use of 
this to rewrite (6.46) in a way that the Lorentz covariance is obvious. It is 


A,(X) = 5 f d*X" J,(X") 6 ((X — X’) @(t-#) (6.49) 


where the integration is now over spacetime, d*X’ = cdt dèx’. The combination of 
the delta-function and step-functions ensure that this integration is limited to the past 
light-cone of a point. 

A Covariant Current 


Next, we want a covariant expression for the current formed by a moving charged 
particle. We saw earlier that a particle tracing out a trajectory y(t) gives rise to a 
charge density (6.30) and current (6.31) given by 


p(x, t)=q0(x-y(t)) and I(x, t) =qv(t) (x — y(t) (6.50) 


(We’ve changed notation from r(t) to y(t) to denote the trajectory of the particle). 
How can we write this in a manifestly covariant form? 


We know from our course on Special Relativity that the best way to parametrise the 
worldline of a particle is by using its proper time 7. We’ll take the particle to have 
trajectory Y“(r) = (ct(rT), y(7)). Then the covariant form of the current is 


JX) = ac | dr Y#(T) (X — Y”(T)) (6.51) 
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It’s not obvious that (6.51) is the same as (6.50). To see that it is, we can decompose 
the delta-function as 


X — Y"(r)) = d(ct — ¥°(7)) F(x — y(7)) 


The first factor allows us to do the integral over dr, but at the expense of picking up 
a Jacobian-like factor 1/Y° from (6.47). We have 


which does give us back the same expressions (6.50). 


Covariant Liénard-Wierchert Potentials 


We can now combine (6.49) and (6.51) to get the retarded potential, 


(ct! — ctret) 
Ix — x'| 


w(x) = BE f dt x! I dr Y"(r) Aar) 


6(ct — Y°(r) — |x - y(7)| 
Ix — y(7)| 


= EE | ar YH(r) 


This remaining delta-function implicitly allows us to do the integral over proper time. 
Using (6.48) we can rewrite it as 


d(ct — Y°(r) — |x - y(7)I) 
2|x — y(7)| 


= 6(R(r) - R(r)) O(R%(r)) (6.52) 
where we’re introduced the separation 4-vector 
R= X¥=Y"(F) 


The delta-function and step-function in (6.52) pick out a unique value of the proper 
time that contributes to the gauge potential at point X. We call this proper time 7,. 
It is the retarded time lying along a null direction, R(7,) - R(7,) = 0. This should be 
thought of as the proper time version of our previous formula (6.36). 


The form (6.52) allows us to do the integral over T. But we still pick up a Jacobian- 
like factor from (6.47). This gives 


SRC) - Rr) (RO) = 
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Putting all of this together gives our covariant form for the Lienard-Wierchert potential, 


A#(X) = Hoge YG) 


Ar |RY(7.)¥L(7)| 


This is our promised, compact expression. Expanding it out will give the previous 
results for the scalar (6.34) and vector (6.35) potentials. (To see this, you’ll need to 
first show that |R’(7,)Y’(7.)| = Y(T) R(T — R(t) - v(%4)/c).) 


The next step is to compute the field strength F y = 0,A, — ô Ap. This is what 
took us some time in Section 6.4.3. It turns out to be somewhat easier in the covariant 
approach. We need to remember that 7, is a function of X”. Then, we get 


_ Hoge YAn) On Y(t) IR(A) \ _ k 
(= ) (u v) (6.53) 


i= - 
E Ar 


(TYT) OXE |Re(r)Y,(r)2 OX" 


The simplest way to compute 07,/0X™" is to start with 7,,R°(7,)R° (7) = 0. Differen- 
tiating gives 


too RE (74) 8," (Ta) = Npo R(T.) (8 — Y° (Ta) Opts) = 0 


Rearranging gives 


For the other term, we have 


OR? (14) Yo (7) | 
OX 


= (öz — Ynja) Yo (Ta) + Ro (tT) Yo( Te) Opts 


= (Rei + 2) Outer Tae) 


where we’ve used rey, = œ. Using these in (6.53), we get our final expression for the 
field strength, 


F Ý, = Ý, Ý, vy Y, 
(=e + RY) FY, Xulty a (6.54) 


ee (R°Y,)2 RÝ, 


4r RY, 


This is the covariant field strength. It takes a little work to write this in terms of the 
component E and B fields but the final answer is, of course, given by (6.42) and (6.44) 
that we derived previously. Indeed, you can see the general structure in (6.54). The 
first term is proportional to velocity and goes as 1/R?; the second term is proportional 
to acceleration and goes as 1/R. 
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6.4.5 Bremsstrahlung, Cyclotron and Synchrotron Radiation 


To end our discussion, we derive the radiation due to some simple relativistic motion. 


Power Radiated Again: Relativistic Larmor Formula 


In Section 6.2.2, we derived the Larmor formula for the emitted power in the electric 
dipole approximation to radiation. In this section, we present the full, relativistic 
version of this formula. 


We’ll work with the expressions for the radiation fields E (6.42) and B (6.44). As 
previously, we consider only the radiative part of the electric and magnetic fields which 
drops off as 1/R. The Poynting vector is 

1 1 a 1 i 
S = —E x B= —E x (Rx E) = —|E/’R 
Ho Moc Moc 
where all of these expressions are to be computed at the retarded time. The second 
equality follows from the relation (6.45), while the final equality follows because the 
radiative part of the electric field (6.42) is perpendicular to R. Using the expression 
(6.42), we have 


_— x[R-v/c)x all R 
1672 9c3 Kê R? 


with k = 1 — R- v/c. 


Recall that everything in the formula above is evaluated at the retarded time t’, 
defined by t + R(t')/c = t. This means, that the coordinates are set up so that we can 
integrate S over a sphere of radius R that surrounds the particle at its retarded time. 
However, there is a subtlety in computing the emitted power, associated to the Doppler 
effect. The energy emitted per unit time ¢ is not the same as the energy emitted per 
unit time t. They differ by the factor dt/dt’ = k. The power emitted per unit time t’, 
per solid angle dQ, is 

dP g@ |É x [(È- v/o x al]? 


— = KkRS. R= 


7 6.55 
dQ 1677 ¢9c¢? K5 ( ) 


To compute the emitted power, we must integrate this expression over the sphere. This 
is somewhat tedious. The result is given by 


q? x2 
P= ñ| + (v-a) (6.56) 
6Tregoc? ce 
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This is the relativistic version of the Larmor formula (6.18). (There is a factor of 2 
difference when compared to (6.20) because the former equation was time averaged). 
We now apply this to some simple examples. 


Bremsstrahlung 


Suppose a particle is travelling in a straight line, with velocity v parallel to acceleration 
a. The most common situation of this type occurs when a particle decelerates. In this 
case, the emitted radiation is called bremsstrahlung, German for “braking radiation”. 


We'll sit at some point x, at which the radiation reaches us from the retarded point 
on the particle’s trajectory r(t’). As before, we define R(t’) = x — r(t’). We introduce 
the angle 0, defined by 


A 


R-v=vcosd 


Because the v x a term in (6.55) vanishes, the angular dependence of the radiation is 
rather simple in this case. It is given by 


dP ea? sin? 0 


dQ 16r2eoc? (1 — (v/c) cos 6)5 


For v < c, the radiation is largest in the direction 0 ~ 7/2, perpendicular to the 
direction of travel. But, at relativistic speeds, v — c, the radiation is beamed in the 
forward direction in two lobes, one on either side of the particle’s trajectory. The total 
power emitted is (6.56) which, in this case, simplifies to 


= @eya2 
67 €Egc3 
Cyclotron and Synchrotron Radiation 


Suppose that the particle travels in a circle, with v -a = 0. We’ll pick axes so that a 
is aligned with the x-axis and v is aligned with the z-axis. Then we write 


A 


R = sin @ cos $x + sin 0 sin gy + cos 02 


After a little algebra, we find that the angular dependence of the emitted radiation is 


dP a? 1 ( sin? 0 cos? o ) 


d 167eqc3 (1 — (v/c) cos)? \-¥2(1 — (v/c) cos 6)? 


At non-relativistic speeds, v < c, the angular dependence takes the somewhat simpler 
form (1 — sin? 0 cos? ¢). In this limit, the radiation is referred to as cyclotron radiation. 


== 


In contrast, in the relativistic limit v > c, the radiation is again beamed mostly in the 
forwards direction. This limit is referred to as synchrotron radiation. The total emitted 
power (6.56) is this time given by 
E q?ya? 
6Teoc’ 


Note that the factors of y differ from the case of linear acceleration. 


St = 


7. Electromagnetism in Matter 


Until now, we’ve focussed exclusively on electric and magnetic fields in vacuum. We end 
this course by describing the behaviour of electric and magnetic fields inside materials, 
whether solids, liquids or gases. 


The materials that we would like to discuss are insulators which, in this context, are 
usually called dielectrics. These materials are the opposite of conductors: they don’t 
have any charges that are free to move around. Moreover, they are typically neutral so 
that — at least when averaged — the charge density vanishes: p = 0. You might think 
that such neutral materials can’t have too much effect on electric and magnetic fields. 
But, as we will see, things are more subtle and interesting. 


7.1 Electric Fields in Matter 
The fate of electric fields inside a dielectric depends O © © 

on the microscopic make-up of the material. We going to Q Q 

work only with the simplest models. We’ll consider our & Q (e) © (e) 


material to be constructed from a lattice of neutral atoms. 


Each of these atoms consists of a positively charged nuclei, © © © (e) (*) 


surrounded by a negatively charged cloud of electrons. A 


cartoon of this is shown in the figure; the nucleus is drawn Figure 59: A simple model 


in red, the cloud of electrons in yellow. gt a neutral material 

Suppose that electric field E is applied to this material. What happens? Although 
each atom is neutral, its individual parts are not. This results in an effect called 
polarisation: the positively charged nucleus gets pushed a little in the direction of E; 
the negatively charged cloud gets pushed a little in the opposite direction. (This is 
not to be confused with the orientation of the electromagnetic wave which also has the 
name “polarisation” ). 


The net effect is that the neutral atom gains an electric dipole moment. Recall from 
Section 2 that two equal and opposite charges, +q and —q, separated by a distance d, 
have an electric dipole p = qd. By convention, p points from the negative charge to 
the positive charge. 


It turns out that in most materials, the induced electric dipole is proportional to the 
electric field, 


p=aE (7.1) 


= ia 
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Figure 60: The polarisation of an atom 


The proportionality factor œ is called the atomic polarisability. Because p points from 
negative to positive charge, it points in the same direction as E. The electric field 
will also result in higher multipole moments of the atoms. (For example, the cloud of 
electrons will be distorted). We will ignore these effects. 


A Simple Model for Atomic Polarisability 


Here’s a simple model which illustrates how the relationship (7.1) arises. It also gives 
a ball-park figure for the value of the atomic polarisability œ. Consider a nucleus of 
charge +q, surrounded by a spherical cloud of electrons of radius a. We’ll take this 
cloud to have uniform charge density. If we just focus on the electron cloud for now, the 
electric field it produces was computed in Section 2: it rises linearly inside the cloud, 
before dropping off as 1/r? outside the cloud. Here we’re interested in the linearly 
increasing behaviour inside 


1 qr. 


Edoud = (r < a) (7.2) 


— =f 
Ate, a 
In the absence of an external field, the nucleus feels the field due to the cloud and sits 
at r = 0. Now apply an external electric field E. The nucleus will be displaced to sit 
at a point where E + Egioug = 0. In other words, it will be displaced by 
Atrega® 
r= —— 
q 


This gives the simple expression a = 47¢€9a?. This isn’t too far off the experimentally 


E > p=gr=4rea°E 


measured values. For example, for hydrogen a/47€9 ~ 0.7 x 10730 m? which, from the 
above formula, suggests that the size of the cloud is around a ~ 1071! m. 


7.1.1 Polarisation 


We’ve learnt that applying an electric field to a material causes each atom to pick up 
a dipole moment. We say that the material is polarised. The polarisation P is defined 


=i 


to be the average dipole moment per unit volume. If n is the density of atoms, each 
with dipole moment p, then we can write 


P = np (7.3) 


We’ve actually dodged a bullet in writing this simple equation and evaded a subtle, but 
important, point. Let me try to explain. Viewed as a function of spatial position, the 
dipole moment p(r) is ridiculously complicated, varying wildly on distances comparable 
to the atomic scale. We really couldn’t care less about any of this. We just want the 
average dipole moment, and that’s what the equation above captures. But we do care 
if the average dipole moment varies over large, macroscopic distances. For example, the 
density n may be larger in some parts of the solid than others. And, as we’ll see, this 
is going to give important, physical effects. This means that we don’t want to take the 
average of p(r) over the whole solid since this would wash out all variations. Instead, 
we just want to average over small distances, blurring out any atomic messiness, but 
still allowing P to depend on r over large scales. The equation P = np is supposed to 
be shorthand for all of this. Needless to say, we could do a better job of defining P if 
forced to, but it won’t be necessary in what follows. 


The polarisation of neutral atoms is not the only way that materials can become 
polarised. One simple example is water. Each H2O molecule already carries a dipole 
moment. (The oxygen atom carries a net negative charge, with each hydrogen carrying 
a positive charge). However, usually these molecules are jumbled up in water, each 
pointing in a different direction so that the dipole moments cancel out and the polari- 
sation is P = 0. This changes if we apply an electric field. Now the dipoles all want to 
align with the electric field, again leading to a polarisation. 


In general, the polarisation P can be a complicated function of the electric field E. 
However, most materials it turns out that P is proportional to E. Such materials are 
called linear dielectrics. They have 


P = e9x-E (7.4) 


where xe is called the electric susceptibility. It is always positive: Xe > 0. Our simple 
minded computation of atomic polarisability above gave such a linear relationship, with 
€oXe = na. 


The reason why most materials are linear dielectrics follows from some simple di- 
mensional analysis. Any function that has P(E = 0) = 0 can be Taylor expanded as a 
linear term + quadratic + cubic and so on. For suitably small electric fields, the linear 
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term always dominates. But how small is small? 'To determine when the quadratic 
and higher order terms become important, we need to know the relevant scale in the 
problem. For us, this is the scale of electric fields inside the atom. But these are huge. 
In most situations, the applied electric field leading to the polarisation is a tiny per- 
turbation and the linear term dominates. Nonetheless, from this discussion it should 
be clear that we do expect the linearity to fail for suitably high electric fields. 


There are other exceptions to linear dielectrics. Perhaps the most striking exception 
are materials for which P ¥ 0 even in the absence of an electric field. Such materials 
— which are not particularly common — are called ferroelectric. For what it’s worth, an 
example is BaTiO}. 


Bound Charge 


Whatever the cause, when a material is po- 
larised there will be regions in which there is a 


charge to emphasise the fact that it’s not allowed 
to move and is arising from polarisation effects. 


build up of electric charge. This is called bound Co 


Let’s illustrate this with a simple example before 


we describe the general case. Let’s go back to our 
lattice of neutral atoms. As we’ve seen, in the pres- Figure 61: A polarised material 
ence of an electric field they become polarised, as 

shown in the figure. However, as long as the polarisation is uniform, so P is constant, 
there is no net charge in the middle of the material: averaged over many atoms, the 
total charge remains the same. The only place that there is a net build up of charge 
is on the surface. In contrast, if P(r) is not constant, there will also be regions in the 
middle that have excess electric charge. 


To describe this, recall that the electric potential due to each dipole p is 


1 p-r 
r) = ——_—— 
d 4Treo r3 
(We computed this in Section 2). Integrating over all these dipoles, we can write the 
potential in terms of the polarisation, 


1 J P a 
V 


ÅT Eo 


o(r) 
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We then have the following manipulations., 


1 3 I / 1 1 
E N P . 
DSR. dS. P(r) 1 [or V- P(r’) 
ATEo Jg Ir—r’|  4reo Jy lr — r’| 


where S is the boundary of V. But both of these terms have a very natural interpre- 
tation. The first is the kind of potential that we would get from a surface charge, 


Obound = P-n 
where n is the normal to the surface S. The second term is the kind of potential that 
we would get from a charge density of the form, 


Pvouna (£) = —V - P(r) (7.5) 


This matches our intuition above. If the polarisation P is constant then we only find 
a surface charge. But if P varies throughout the material then this can lead to non- 
vanishing charge density sitting inside the material. 


7.1.2 Electric Displacement 
We learned in our first course that the electric field obeys Gauss’ law 


Y.E= 
E0 

This is a fundamental law of Nature. It doesn’t change just because we’re inside a 
material. But, from our discussion above, we see that there’s a natural way to separate 
the electric charge into two different types. There is the bound charge ppounag that 
arises due to polarisation. And then there is anything else. This could be some electric 
impurities that are stuck in the dielectric, or it could be charge that is free to move 
because our insulator wasn’t quite as good an insulator as we originally assumed. The 
only important thing is that this other charge does not arise due to polarisation. We 
call this extra charge free charge, Pfree. Gauss’ law reads 


1 
TESS 
€0 

1 
= (Ptree - V. P) 
€0 


Pfree + Pbound) 


We define the electric displacement, 


D=eE+P (7.6) 
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This obeys 
V - D = Pree (7.7) 


That’s quite nice. Gauss’ law for the displacement involves only the free charge; any 
bound charge arising from polarisation has been absorbed into the definition of D. 


For linear dielectrics, the polarisation is given by (7.4) and the displacement is pro- 
portional to the electric field. We write 


D=cE 


where € = €9(1+-) is the called the permittivity of the material. We see that, for linear 
dielectrics, things are rather simple: all we have to do is replace €9 with € everywhere. 
Because € > €o, it means that the electric field will be decreased. We say that it is 
screened by the bound charge. The amount by which the electric field is reduced is 
given by the dimensionless relative permittivity or dielectric constant, 


€ 
€ =—=1+ Xe 
€0 
For gases, €, is very close to 1. (It differs at one part in 10~° or less). For water, 
Er & 80. 


An Example: A Dielectric Sphere 


As a simple example, consider a sphere of dielectric material of radius R. We’ll place 
a charge Q at the centre. This gives rise to an electric field which polarises the sphere 
and creates bound charge. We want to understand the resulting electric field E and 
electric displacement D. 


The modified Gauss’ law (7.7) allows us to easily compute D using the same kind of 
methods that we used in Section 2. We have 


Q . 


= —_f 
Arr? 


(r< R) 


where the condition r < R means that this holds inside the dielectric. The electric field 
is then given by 


Q o Qe. 


4rer? 4Teor? 


(r < R) (7.8) 


AT 


This is what we’d expect from a charge Q/e, placed at the 
origin. The interpretation of this is that there is the bound 


charge gathers at the origin, screening the original charge is | a 

Q. This bound charge is shown as the yellow ring in the 

figure surrounding the original charge in red. The amount <~ — 

of bound charge is simply the difference S 
A 
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This bound charge came from the polarisation of the sphere. 


But the sphere is a neutral object which means that total Figure 62: A polarised ma- 


charge on it has to be zero. To accomplish this, there must terial 
be an equal, but opposite, charge on the surface of the sphere. This is shown as the 


red rim in the figure. This surface charge is given by 


€ — 1 


Q 


2 2. 
4rR Obound = = round = P 
r 


We know from our first course that such a surface charge will lead to a discontinuity 
in the electric field. And that’s exactly what happens. Inside the sphere, the electric 
field is given by (7.8). Meanwhile outside the sphere, Gauss’ law knows nothing about 
the intricacies of polarisation and we get the usual electric field due to a charge Q, 


Q 


— —_~ f o> 
Ategr? ( ) 
At the surface r = R there is a discontinuity, 


A a Q Q Obound 
E-?|, —E-#|_ = = = 
tl îl 4negR? 4reR? €o 


which is precisely the expected discontinuity due to surface charge. 


7.2 Magnetic Fields in Matter 


Electric fields are created by charges; magnetic fields are created by currents. We 
learned in our first course that the simplest way to characterise any localised current 
distribution is through a magnetic dipole moment m. For example, a current J moving 
in a planar loop of area A with normal ñ has magnetic dipole moment, 


m = I Añ 


The resulting long-distance gauge field and magnetic field are 


an=- EI = ge- f (3mm) 


4r r3 An r3 
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The basic idea of this section is that current loops, and their associated dipole moments, 
already exist inside materials. They arise through two mechanisms: 


e Electrons orbiting the nucleus carry angular momentum and act as magnetic 
dipole moments. 


e Electrons carry an intrinsic spin. This is purely a quantum mechanical effect. 
This too contributes to the magnetic dipole moment. 


In the last section, we defined the polarisation P to be the average dipole moment per 
unit volume. In analogy, we define the magnetisation M to be the average magnetic 
dipole moment per unit volume. Just as in the polarisation case, here “average” means 
averaging over atomic distances, but keeping any macroscopic variations of the polari- 
sation M(r). It’s annoyingly difficult to come up with simple yet concise notation for 
this. Pll choose to write, 


M(r) = n(m(r)) 


where n is the density of magnetic dipoles (which can, in principle, also depend on 
position) and the notation (-) means averaging over atomic distance scales. In most 
(but not all) materials, if there is no applied magnetic field then the different atomic 
dipoles all point in random directions. This means that, after averaging, (m) = 0 
when B = 0. However, when a magnetic field is applied, the dipoles line up. The 
magnetisation typically takes the form M « B. We’re going to use a slightly strange 
notation for the proportionality constant. (It’s historical but, as we’ll see, it turns out 
to simplify a later equation) 


1 m 
Wisc“ op (7.9) 
Hol +Xm 


where Xm is the magnetic susceptibility. The magnetic properties of materials fall into 


three different categories. The first two are dictated by the sign of Xm: 


e Diamagnetism: —1 < Xm < 0. The magnetisation of diamagnetic materials 
points in the opposite direction to the applied magnetic field. Most metals are 
diamagnetic, including copper and gold. Most non-metallic materials are also 
diamagnetic including, importantly, water with ym œ% —1075. This means, fa- 
mously, that frogs are also diamagnetic. Superconductors can be thought of as 
“perfect” diamagnets with Xm = —1. 


e Paramagnetism: Xm > 0. In paramagnets, the magnetisation points in the same 
direction as the field. There are a number of paramagnetic metals, including 
Tungsten, Cesium and Aluminium. 
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We see that the situation is already richer than what we saw in the previous section. 
There, the polarisation takes the form P = e9x-E with vy. > 0. In contrast, Xm can 
have either sign. On top of this, there is another important class of material that don’t 
obey (7.9). These are ferromagnets: 


e Ferromagnetism: M 4 0 when B = 0. Materials with this property are what you 
usually call “magnets”. They’re the things stuck to your fridge. The direction of 
B is from the south pole to the north. Only a few elements are ferromagnetic. 
The most familiar is Iron. Nickel and Cobalt are other examples. 


In this course, we won’t describe the microscopic effects that cause these different mag- 
netic properties. They all involve quantum mechanics. (Indeed, the Bohr-van Leeuwen 
theorem says magnetism can’t happen in a classical world — see the lecture notes on 
Classical Dynamics). A number of mechanisms for paramagetism and diamagnetism 
in metals are described in the lecture notes on Statistical Physics. 


7.2.1 Bound Currents 


In the previous section, we saw that when a material is polarised, it results in bound 
charge. There is a similar story here. When a material becomes magnetised (at least in 
an anisotropic way), there will necessarily be regions in which there is a current. This 
is called the bound current. 


Let’s first give an intuitive picture for where these bound currents appear from. 
Consider a bunch of equal magnetic dipoles arranged uniformly on a plane like this: 


M 
y Ste i= i= > 


The currents in the interior region cancel out and we’re left only with a surface current 


around the edge. In Section 3, we denoted a surface current as K. We’ll follow this 
notation and call the surface current arising from a constant, internal magnetisation 


KK pound: 


Now consider instead a situation where the dipoles are arranged on a plane, but have 
different sizes. We’ll put the big ones to the left and the small ones to the right, like 
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this: 
M 


J bound 


IEE 


In this case, the currents in the interior no longer cancel. As we can see from the 


picture, they go into the page. Since M is out of the page, and we’ve arranged things 
so that M varies from left to right, this suggests that Jpoung ~ V x M. 


Let’s now put some equations on this intuition. We know that the gauge potential 
due to a magnetic dipole is 
— pwomxr 
An r 


Integrating over all dipoles, and doing the same kinds of manipulations that we saw for 


A(r) 


the polarisations, we have 
M(r’) x (r —1’) 


A(r) = Po f Br 


An Jy Ir- r'|’ 
1 
= | r M(r’) x V’ | —— 
An Jy Ir-r] 
— _ Ho dS’ x M(r’) mf 3 iV x M(r’) 
An Jg r-r| 47 Jy Ir-r] 


Again, both of these terms have natural interpretation. The first can be thought of as 
due to a surface current 


Krouna =Mxû 


where 7 is normal to the surface. The second term is the bound current in the bulk 
of the material. We can compare its form to the general expression for the Biot-Savart 
law that we derived in Section 3, 


aot fer Be 
Ar r-r] 


We see that the bound current is given by 


J iene =VxM (7.10) 


as expected from our intuitive description above. Note that the bound current is a 
steady current, in the sense that it obeys V - Jbouna = 0. 
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7.2.2 Ampére’s Law Revisited 


Recall that Ampére’s law describes the magnetic field generated by static currents. 
We’ve now learned that, in a material, there can be two contributions to a current: 
the bound current Jpouna that we’ve discussed above, and the current Jee from freely 
flowing electrons that we were implicitly talking. In Section 3, we were implicitly 
talking about Js... when we discussed currents. Ampére’s law does not distinguish 
between these two currents; the magnetic field receives contributions from both. 


VxB= (odes = Jeanna) 
= LoS free T HoV xM 


We define the magnetising field, H as 


1 
H=—B-M (7.11) 
Ho 
This obeys 
Vx H= Je. (7.12) 


We see that the field H plays a similar role to the electric displacement D; the effect of 
the bound currents have been absorbed into H, so that only the free currents contribute. 
Note, however, that we can’t quite forget about B entirely, since it obeys V - B = 0. 
In contrast, we don’t necessarily have “V -H = 0”. Rather annoyingly, in a number of 
books H is called the magnetic field and B is called the magnetic induction. But this 
is stupid terminology so we won’t use it. 


For diamagnets or paramagnets, the magnetisation is linear in the applied magnetic 
field B and we can write 


B = pH 
A little algebra shows that u = uo(l + Xm). It is called the permeability. For most 
materials, u differs from fig only by 1 part in 10° or so. Finally, note that the somewhat 
strange definition (7.9) leaves us with the more sensible relationship between M and 


H, 


M = XmH 


=182= 


7.3 Macroscopic Maxwell Equations 


We've seen that the presence of bound charge and bound currents in matter can be 
absorbed into the definitions of D and H. This allowed us to present versions of Gauss’ 
law (7.7) and Ampére’s law (7.12) which feature only the free charges and free currents. 
These equations hold for electrostatic and magnetostatic situations respectively. In this 
section we explain how to reformulate Maxwell’s equations in matter in more general, 
time dependent, situations. 


Famously, when fields depend on time there is an extra term required in Ampére’s 
law. However, there is also an extra term in the expression (7.10) for the bound 
current. This arises because the bound charge, Ppouna, no longer sits still. It moves. 
But although it moves, it must still be locally conserved which means that it should 
satisfy a continuity equation 
_ OPvound 

ot 


From our earlier analysis (7.5), we can express the bound charge in terms of the polar- 


V. Jbouna = 


isation: Ppouna = —V - P. Including both this contribution and the contribution (7.10) 
from the magnetisation, we have the more general expression for the bound current 


oP 
J ound — VxM saa 
bound T DI 
Let’s see how we can package the Maxwell equation using this notation. We’re inter- 
ested in the extension to Ampére’s law which reads 
1 OE 


Boas. 
Ve c? Ot 


= Lod tree Ir Lod pound 
oP 
= Hod tree aI HoV xM+ Hoar 


As before, we can use the definition of H in (7.11) to absorb the magnetisation term. 


But we can also use the definition of D to absorb the polarisation term. We’re left 
with the Maxwell equation 
oD 
V x H — — = Jfree 
a `f 


The Macroscopic Maxwell Equations 


Let’s gather together everything we’ve learned. Inside matter, the four Maxwell equa- 
tions become 


V-D = pře and V XH dies 
B 
V-B=0 and vxE=-= (7.13) 
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There are the macroscopic Maxwell equations. Note that half of them are written in 
terms of the original E and B while the other half are written in terms of D and H. 
Before we solve them, we need to know the relationships between these quantities. In 
the simplest, linear materials, this can be written as 


D=cE and B=ywH 


Doesn’t all this look simple! The atomic mess that accompanies most materials can 
simply be absorbed into two constants, the permittivity € and the permeability u. Be 
warned, however: things are not always as simple as they seem. In particular, we’ll see 
in Section 7.5 that the permittivity € is not as constant as we’re pretending. 


7.3.1 A First Look at Waves in Matter 


We saw earlier how the Maxwell equations give rise to propagating waves, travelling 
with speed c. We call these waves “light”. Much of our interest in this section will be on 
what becomes of these waves when we work with the macroscopic Maxwell equations. 
What happens when they bounce off different materials? What really happens when 
they propagate through materials? 


Let’s start by looking at the basics. In the absence of any free charge or currents, 
the macroscopic Maxwell equations (7.13) become 


V-D=0 and vis 
Ot 
V:-B=0 and vxp=-% (7.14) 


which should be viewed together with the relationships D = «E and B = uH. But 
these are of exactly the same form as the Maxwell equations in vacuum. Which means 
that, at first glance, the propagation of waves through a medium works just like in 
vacuum. All we have to do is replace ceo —> € and uo — p. By the same sort of 
manipulations that we used in Section 4.3, we can derive the wave equations 


1 E 1 0?H 
v2 Ot? v2 ot? 


-V°H =0 


The only difference from what we saw before is that the speed of propagation is now 
given by 


— 184- 


This is less than the speed in vacuum: v? < œ. It’s common to define the index of 
refraction, n, as 
n=->1 7.15 
<> (7.15) 
In most materials, u ~ fo. In this case, the index of refraction is given in terms of the 
dielectric constant as 


n X Je, 


The monochromatic, plane wave solutions to the macroscopic wave equations take the 
familiar form 


E = Ep é and B = By e*t 
where the dispersion relation is now given by 
aa 
The polarisation vectors must obey Ey - k = Bo - k = 0 and 


_kxE 
p U 


Bo 


Boundary Conditions 


In what follows, we’re going to spend a lot of time bouncing waves off various surfaces. 
We'll typically consider an interface between two dielectric materials with different 
permittivities, €; and €. In this situation, we need to know how to patch together the 
fields on either side. 


Let’s first recall the boundary conditions that we derived in Sections 2 and 3. In 
the presence of surface charge, the electric field normal to the surface is discontinuous, 
while the electric field tangent to the surface is continuous. For magnetic fields, it’s the 
other way around: in the presence of a surface current, the magnetic field normal to the 
surface is continuous while the magnetic field tangent to the surface is discontinuous. 


What happens with dielectrics? Now we have two options of the electric field, E and 
D, and two options for the magnetic field, B and H. They can’t both be continuous 
because they’re related by D = «E and B = uH and we'll be interested in situation 
where e (and possibly u) are different on either side. Nonetheless, we can use the 
same kind of computations that we saw previously to derive the boundary conditions. 
Roughly, we get one boundary condition from each of the Maxwell equations. 
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Figure 63: The normal component of the Figure 64: The tangential component of 
electric field is discontinuous the electric field is continuous. 


For example, consider the Gaussian pillbox shown in the left-hand figure above. 
Integrating the Maxwell equation V - D = Pree tells us that the normal component of 
D is discontinuous in the presence of surface charge, 


where ñ is the normal component pointing from 1 into 2. Here ø refers only to the free 
surface charge. It does not include any bound charges. Similarly, integrating V -B = 0 


over the same Gaussian pillbox tells us that the normal component of the magnetic 
field is continuous, 


û - (Bə —B,) =0 (7.17) 


To determine the tangential components, we integrate the appropriate field around the 
loop shown in the right-hand figure above. By Stoke’s theorem, this is going to be 
equal to the integral of the curl of the field over the bounding surface. This tells us 
what the appropriate field is: it’s whatever appears in the Maxwell equations with a 
curl. So if we integrate E around the loop, we get the result 


Meanwhile, integrating H around the loop tells us the discontinuity condition for the 
magnetic field 


n x (Hə — Hı) = K (7.19) 
where K is the surface current. 


7.4 Reflection and Refraction 


We’re now going to shine light on something and watch how it bounces off. We did 
something very similar in Section 4.3, where the light reflected off a conductor. Here, 
we're going to shine the light from one dielectric material into another. These two 
materials will be characterised by the parameters €1, 4, and €2, u2. We’ll place the 
interface at x = 0, with “region one” to the left and “region two” to the right. 


= 186 = 


m 
N 
T 
N 


& u 


\ 
<< 
~~ 


4 


Uf 
x 


M 


7] 


\« 
a 


Figure 65: Incident, reflected and transmitted waves in a dielectric interface. 


We send in an incident wave from region one towards the interface with a frequency 
wr and wavevector kz, 


Vw 


Eine = E; el(krx—wit 
where 
k; = krcosôr $ + ky sin 0;z 


When the wave hits the interface, two things can happen. It can be reflected, or it can 
pass through to the other region. In fact, in general, both of these things will happen. 
The reflected wave takes the general form, 


Exet = Er cllkr-x—wet) 


where we’ve allowed for the possibility that the amplitude, frequency, wavevector and 
polarisation all may change. We will write the reflected wavevector as 


kpr = —krcos 0r X + krsinĝrŽ 


Meanwhile, the part of the wave that passes through the interface and into the second 
region is the transmitted wave which takes the form, 


Erans = Er p OPEN 
with 


kr = kr cos 0r X + kr sin 0r ĉ (7.20) 
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Again, we’ve allowed for the possibility that all the different properties of the wave 
could differ from the incoming wave. The electric field then takes the general form, 


Eine = Ere <0 
E = p 

Bigane T > 0 
All of this is summarised in the figure. 


We want to impose the matching conditions (7.16), (7.18), (7.19) and (7.17), with 
no surface charges and no surface currents. To start, we need the phase factors to be 
equal for all time. This means that we must have 


WI = WR = WT (7.21) 
and 
k;-x=kgręr:x=kr:x atzx=0 (7.22) 


This latter condition tells us that all of the wavevectors lie in the (x, z)-plane because 
kz originally lay in this plane. It further imposes the equality of the z components of 
the wavevectors: 


kr sin Or = kr sin Op S kr sin ôr (1:23) 


But, in each region, the frequency and wavenumbers are related, through the dispersion 
relation, to the speed of the wave. In region 1, we have wr = vikr and wr = vikr which, 
using (7.21) and (7.23), tells us that 


0; = Or 
This is the familiar law of reflection. 


Meanwhile, in region 2 we have wr = ugkr. Now (7.21) and (7.23) tell us that 


siny sinr 


vı U2 


In terms of the refractive index n = c/v, this reads 
nı sind; = nə sin dr (7.24) 


This is the law of refraction, known as Snell’s law. 
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Figure 66: Incident, reflected and transmitted waves with normal polarisation. 


7.4.1 Fresnel Equations 


There’s more information to be extracted from this calculation: we can look at the 
amplitudes of the reflected and transmitted waves. As we now show, this depends on 
the polarisation of the incident wave. There are two cases: 


Normal Polarisation: 


When the direction of Ez = Ey is normal to the (x, z)-plane of incidence, it’s simple 
to check that the electric polarisation of the other waves must lie in the same direction: 
Er = Ery and Er = Evry. This situation, shown in Figure 66, is sometimes referred 
to as s-polarised (because the German word for normal begins with s). 

The matching condition (7.18) requires 


Er + Er = Er 


Meanwhile, as we saw in (7.16), the magnetic fields are given by B = (k x E)/v. The 
matching condition (7.19) then tells us that 
Er— Er 


Er 
Brcosĝr — BreosOg = Breosér => —— cos6; = — cosôr 
Ul U2 


With a little algebra, we can massage these conditions into the expressions, 


Ep ny cos6; — ng cos Or d 
= an = 
Er nn, cos; + ng cos Or Er — n,cos@; + ng cos Op 


Er _ 2n: cos 67 


(7.25) 


These are the Fresnel equations for normal polarised light. We can then use Snell’s law 
(7.24) to get the amplitudes in terms of the refractive indices and the incident angle 
Or. 
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Figure 67: The reflected field with nor- Figure 68: The transmitted field with 
mal polarisation normal polarisation 


The most common example is if region 1 contains only air, with nı ~ 1, and region 
2 consists of some transparent material. (For example, glass which has nz ~ 1.5). The 
normalised reflected and transmitted fields are plotted in the figures above for nı = 1 
and nz = 2, with 0; plotted in degrees along the horizontal axis). Note that the vertical 
axes are different; negative for the reflected wave, positive for the transmitted wave. In 
particular, when 0 = 90°, the whole wave is reflected and nothing is transmitted. 


Parallel Polarisation: 


The case in which the electric field lies lies within the (x, z)-plane of incidence is 
sometimes referred to as p-polarised (because the English word for parallel begins with 
p). It is shown in Figure 69. Of course, we still require E; - k = 0, which means that 


E; = — Ersin br + E; cos6;z 


with similar expressions for Eg and Er. The magnetic field now lies in the +y direction. 
The matching condition (7.18) equates the components of the electric field tangential 
to the surface. This means 


Er cos 0; + Epcos 0p = Er cos dr 


while the matching condition (7.19) for the components of magnetic field tangent to 
the surface gives 


Er—Er Er 


U1 v2 


Br-Br=Br => 


where the minus sign for Br can be traced to the fact that the direction of the B field 
(relative to k) points in the opposite direction after a reflection. These two conditions 
can be written as 

Ep ny cos6p — nz cos 6; Er 2n, cos 6; 


E d = 7.26 
Er  n,cos@r + nz cos 6; di Er — ni cosôr + nz cos 8; ( ) 
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Figure 69: Incident, reflected and transmitted waves with parallel polarisation. 


These are the Fresnel equations for parallel polarised light. Note that when the incident 
wave is normal to the surface, so both 0; = 0r = 0, the amplitudes for the normal (7.25) 
and parallel (7.26) polarisations coincide. But in general, they are different. 


We can again plot the reflected and transmitted amplitudes in the case n; = 1 and 
Nz = 2, shown in the figure. 


Brewster’s Angle 


We can see from the left-hand figure that something interesting happens in the case of 

parallel polarisation. There is an angle for which there is no reflected wave. Everything 

gets transmitted. This is called the Brewster Angle, 0g. It occurs when nı cos ôr = 

nz cos6;. Of course, we also need to obey Snell’s law (7.24). These two conditions are 
only satisfied when 6; + 07 = 1/2. The Brewster angle is given by 

tan 0g = a 

nı 


For the transmission of waves from air to glass, 0g ~ 56°. 


Brewster’s angle gives a simple way to create polarised light: shine unpolarised light 
on a dielectric at angle 0g and the only thing that bounces back has normal polarisation. 
This is the way sunglasses work to block out polarised light from the Sun. It is also 
the way polarising filters work. 


7.4.2 Total Internal Reflection 


Let’s return to Snell’s law (7.24) that tells us the angle of refraction, 


‘ mi, 
sin ôr = — sin 0; 
n2 
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Figure 70: The reflected field with par- Figure 71: The transmitted field with 
allel polarisation parallel polarisation 


But there’s a problem with this equation: if ng > nı then the right-hand side can be 
greater that one, in which case there are no solutions. This happens at the critical 
angle of incidence, 0c, defined by 


For example, if light is moving from glass, into air, then 0¢ ~ 42°. At this angle, 
and beyond, there is no transmitted wave. Everything is reflected. This is called total 
internal reflection. It’s what makes diamonds sparkle and makes optical fibres to work. 


Here our interest is not in jewellery, but rather in a theoretical puzzle about how 
total internal reflection can be consistent. After all, we’ve computed the amplitude of 
the transmitted electric field in (7.25) and (7.26) and it’s simple to check that it doesn’t 
vanish when 6; = 0c. What’s going on? 


The answer lies back in our expression for the transmitted wavevector kp which 
we decomposed in (7.20) using geometry. The matching condition (7.22) tells us that 
kr: = 0 and 


R R WI . 
kp -%Z=ky-Z= H sin 6 
U1 
But, from the matching of frequencies (7.21), we know that wr = wr = w. We also 
know that the magnitude of the transmitted wavevector is given by |kr|? = w?/v3. But 
this means that the component of the wavevector in the x direction of propagation 


must be 
2 orn? 2 ain 
p - W vs sinf 0 wW nz sin“ 8 
kr: $ = +y/|kr|? — (kr 2} = +441 - 2 = 441 -2 
Və v? Və ns 
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We see that when nı sin r/n > 1, the X component of the wavevector is imaginary! 
We'll write kp - È = tiwa/v2. An imaginary wavevector sounds strange, but it’s very 
simple to interpret: we simply substitute it into our wave solution to find 


Eirans Z Er e(ikr-2—wt) erwax/ve zt>0 


Picking the minus sign in the exponent gives the physically sensible solution which 
decays as we move into region 2. We see that beyond the critical angle 0¢, there is no 
propagating wave in region 2. Instead it is replaced by a decaying solution. This is 
called an evanescent wave. 


As we'll now see, the idea that the wavevector can be imaginary is very useful in a 
many other circumstances. 


7.5 Dispersion 


The dielectric constant €, = €/€9 is poorly named. It is not constant. This is because, 
in the presence of time-dependent electric fields, the permittivity typically depends 
on the frequency: € = e(w). In this section, we will first provide a simple model to 
understand why this is the case and what form of e(w) we should expect. We’ll then 
move on to see the consequences of this frequency dependence. 


7.5.1 Atomic Polarisability Revisited 


In Section 7.1, we introduced a simple model for electric polarisability. This treats the 
atom as a point-like nucleus with charge q, surrounded by a cloud of electrons which 
we treat as a solid ball of radius a with uniform charge density. It’s obviously a daft 
model for the atom, but it will do for our purposes. 


Suppose that the centre of the electron cloud is displaced by a distance r. (You can 
equivalently think of the nucleus as displaced by the same distance in the opposite 
direction). We previously computed the restoring force (7.2) which acts on cloud, 

2 


q _ 2 
zr = —muwor 
Atreja 


F aoua Sa 


In the final equality, we’ve introduced the mass m of the cloud and defined the quantity 
wo which we will call the resonant frequency. 


In Section 7.1, we just looked at the equilibrium configuration of the electron cloud. 
Here, instead, we want to subject the atom to a time-dependent electric field E(t). In 
this situation, the electron cloud also feels a damping force 


F damping = =—myr (7.27) 
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for some constant coefficient y. You might find it strange to see such a friction term 
occurring for an atomic system. After all, we usually learn that friction is the effect 
of averaging over many many atoms. The purpose of this term is to capture the fact 
that the atom can lose energy, either to surrounding atoms or emitted electromagnetic 
radiation. If we now apply a time dependent electric field E(t) to this atom, the 
equation of motion for the displacement it 


më = —qE(t) — mugr — myi (7.28) 
Solutions to this describe the atomic cloud oscillating about the nucleus. 


The time dependent electric field will be of the wave form that 


we’ve seen throughout these lectures: E = Eye**?~-“) 


. However, the 
atom is tiny. In particular, it is small compared to the wavelength 


of (at least) visible light, meaning ka < 1. For this reason, we can 


ignore the fact that the phase oscillates in space and work with an 
electric field of the form E(t) = Ege~“’. Then (7.28) is the equation Fo «—— > 
for a forced, damped harmonic oscillator. We search for solutions to 

(7.28) of the form r(t) = roe~**. (In the end we will take the real Figure 72: 
part). The solution is 


qEo 1 
m —we + we — iyw 


rjy= = 


This gives the atomic polarisability p = aE, where 


2 
a= iim (7.29) 
—W* + Wo — yw 


As promised, the polarisability depends on the frequency. Moreover, it is also complex. 
This has the effect that the polarisation of the atom is not in phase with the oscillating 
electric field. 


Because the polarisability is both frequency dependent and complex, the permittivity 
€(w) will also be both frequency dependent and complex. (In the simplest settings, they 
are related by e(w) = ceo + na(w) where n is the density of atoms). We’ll now see the 
effect this has on the propagation of electromagnetic waves through materials. 


7.5.2 Electromagnetic Waves Revisited 


To start, we’ll consider a general form of the permittivity e(w) which is both frequency 
dependent and complex; we’ll return to the specific form arising from the polarisability 
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(7.29) later. In contrast, we will assume that the magnetic thing u is both constant 
and real, which turns out to be a good approximation for most materials. This means 
that we have 


D=c«w)E and B=yH 


We’ll look for plane wave solutions, so that the electric and magnetic fields takes the 
form 


E(x, t) = E(w) cilk-x—wt) and B(x, t) = B(w) cilk-x—wt) 
Maxwell’s equations in matter were given in (7.14). The first two simply tell us 


V-D=0 => e(w)k- E(w) =0 
V:-B=0 => k-Bi(w)=0 


These are the statements that the electric and magnetic fields remain transverse to the 
direction of propagation. (In fact there’s a caveat here: if e(w) = 0 for some frequency 
w, then the electric field need not be transverse. This won’t affect our discussion 
here, but we will see an example of this when we turn to conductors in Section 7.6). 
Meanwhile, the other two equations are 


Vx H= 2 => kx B(w) = —ue(w)wE(w) 
V x E= — =>  kxE(w)=wB(w) (7.30) 


We do the same manipulation that we’ve seen before: look at k x (k x E) and use the 
fact that k- E = 0. This gives us the dispersion relation 


k- k = pe(w)w? (7.31) 


We need to understand what this equation is telling us. In particular, e(w) is typically 
complex. This, in turn, means that the wavevector k will also be complex. To be 
specific, we’ll look at waves propagating in the z-direction and write k = kz. We’ll 
write the real and imaginary parts as 


elw) = e&l(w) + ieg(w) and k= kı + iko 
Then the dispersion relation reads 


ky + ikə = ORV ATV, €& + 1€5 (7.32) 
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and the electric field takes the form 
E(x, t) = E(w) e7** et(*12-#) (7.33) 


We now see the consequence of the imaginary part of e(w); it causes the amplitude of 
the wave to decay as it extends in the z-direction. This is also called attenuation. The 
real part, kı, determines the oscillating part of the wave. The fact that € depends on 
w means that waves of different frequencies travel with different speeds. We’ll discuss 
shortly the ways of characterising these speeds. 


The magnetic field is 


ip 
Hoyer 3 


F o 4N E(w) 


where ¢ = tan~'(k2/k 1) is the phase of the complex wavenumber k. This is the second 
consequence of a complex permittivity e(w); it results in the electric and magnetic fields 
oscillating out of phase. The profile of the magnetic field is 


k , 
B(x, t) = l(a x B(w)) e elle ort) (7.34) 
W 


As always, the physical fields are simply the real parts of (7.33) and (7.34), namely 
E(x, t) = E(w) e™?7 cos(kız — wt) 
k . 
B(x,t) = lkl ez x E(w)) e™?? cos(kız — wt + ¢) 
Ww 


To recap: the imaginary part of e means that k2 4 0. This has two effects: it leads to 
the damping of the fields, and to the phase shift between E and B. 


Measures of Velocity 


The other new feature of e(w) is that it depends on the frequency w. The dispersion 
relation (7.31) then immediately tells us that waves of different frequencies travel at 
different speeds. There are two, useful characterisations of these speeds. The phase 
velocity is defined as 


As we can see from (7.33) and (7.34), a wave of a fixed frequency w propagates with 
phase velocity vp(w). 
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Waves of different frequency will travel with different phase velocities v,. This means 
that for wave pulses, which consist of many different frequencies, different parts of the 
wave will travel with different speeds. This will typically result in a change of shape of 
the pulse as it moves along. We’d like to find a way to characterise the speed of the 
whole pulse. The usual measure is the group velocity, defined as 


_ de 
dk, 


Ug 


where we’ve inverted (7.31) so that we’re now viewing frequency as a function of (real) 
wavenumber: w(k,). 


To see why the group velocity is a good measure of the speed, let’s build a pulse by 
superposing lots of waves of different frequencies. To make life simple, we’ll briefly set 
€ = 0 and kı = k for now so that we don’t have to think about damping effects. Then, 
focussing on the electric field, we can build a pulse by writing 


Suppose that our choice of wavepacket E(k) is heavily peaked around some fixed 
wavenumber kg. Then we can expand the exponent as 


dw 
— — kot 
dk |, =i) 


—[w(ko) + vg (Kot + k[z — (ko) t] 


kz —w(k)t ~ kz —w(ko)t — 


The first term is just a constant oscillation in time; the second, k-dependent term is 
the one of interest. It tells us that the peak of the wave pulse is moving to the right 
with approximate speed v,(ko). 


Following (7.15), we also define the index of refraction 


This allows us to write a relation between the group and phase velocities: 


= | 
Up CW 


1 dkı d (=) ae! w dn 


Ug dw ~ dw Ve 


Materials with dn/dw > 0 have vg < vp; this is called normal dispersion. Materials 
with dn/dw < 0 have vy > vp; this is called anomalous dispersion. 
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Figure 73: The real part of the permit- Figure 74: The imaginary part of the 
tivity, €1 — €o permittivity, €2 


7.5.3 A Model for Dispersion 


Let’s see how this story works for our simple model of atomic polarisability a(w) given 
in (7.29). The permittivity is e(w) = €o + na(w) where n is the density of atoms. The 
real and imaginary parts € = €; + i€2 are 


nq w? — we 


€ = éo — 
j om (w= we)? + yw? 
ng? yw 


m (w? — we)? + yw? 


€&= 
These functions look like this: (These particular plots are made with y = 1 and wo = 2 
and nq?/m = 1). 


The real part is an even function: it has a maximum at w = wọ — y/2 and a minimum 
at w = wọ + 7/2, each offset from the resonant frequency by an amount proportional to 
the damping y. The imaginary part is an odd function; it has a maximum at w = wo, 
the resonant frequency of the atom. The width of the imaginary part is roughly 7/2. 


A quantity that will prove important later is the plasma frequency, wp. This is defined 
as 


2 
pe (7.35) 


a MEG 


We’ll see the relevance of this quantity in Section 7.6. But for now it will simply be a 
useful combination that appears in some formulae below. 


The dispersion relation (7.32) tells us 


k? = k? + Qiky ke = wule =F i€2) 
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Equating real and imaginary parts, we have 


1 1 M 
ky = +wy/H Gv e2 + 2 + ze) 
1 i 47 
kə = wyp Gre +e - 5) (7.36) 


To understand how light propagates through the material, we need to look at the values 
of kı and kə for different values of the frequency. There are three different types of 
behaviour. 


Transparent Propagation: Very high or very low frequencies 


The most straightforward physics happens when <€, > 0 and €; >> €9. For our simple 
model, this ocurs when w < wo — y/2 or when w > w,, the value at which €,(w,) = 0. 


Expanding to leading order, we have 


2 
ky S+w./ue, and kp & tw ee = | Ss ky < ky 
4e 2€1 


Because ky < kı, the damping is small. This means that the material is transparent 
at these frequencies. 


There’s more to this story. For the low frequencies, € > €9 + ng?/mw?. This is the 
same kind of situation that we dealt with in Section 7.3. The phase velocity vp < c in 
this regime. For high frequencies, however, €, < €o; in fact, €1(w) — €o from below as 
w — oo. This means that v, > c in this region. This is nothing to be scared of! The 
plane wave is already spread throughout space; it’s not communicating any information 
faster than light. Instead, pulses propagate at the group velocity, v,. This is less than 
the speed of light, vy < c, in both high and low frequency regimes. 


Resonant Absorption: w & wo 


Resonant absorption occurs when € > |e,|. In our model, this phenomenon is most 
pronounced when wọ >> y so that the resonant peak of €2 is sharp. Then for frequencies 
close to the resonance, w ~ wo + y/2, we have 


=s 


We see that we meet the requirement for resonant absorption if we also have wp Z wo. 
When € > |e;|, we can expand (7.36) to find 


ky & kg & w a 
2 
The fact that ky ~ kı means that the wave decays very rapidly: it has effectively 
disappeared within just a few wavelengths of propagation. This is because the frequency 
of the wave is tuned to coincide with the natural frequency of the atoms, which easily 
become excited, absorbing energy from the wave. 


Total Reflection: 


The third region of interest occurs when €, < 0 and |e;| >> €2. In our model, it is 
roughly for frequencies wo + y/2 < w < w,. Now, the expansion of (7.36) gives 


i lè 1 ue & [pl 
ky set ae eae ~ tw, | — 
ix taya (zlal+ apt gat) 7y Jal 


€ 
ky ~ wy ulel = lal, > ky 
2 


and 


Now the wavenumber is almost pure imaginary. The wave doesn’t even manage to get 
a few wavelengths before it decays. It’s almost all gone before it even travels a single 
wavelength. 


We're not tuned to the resonant frequency, so this time the wave isn’t being absorbed 
by the atoms. Instead, the applied electromagnetic field is almost entirely cancelled 
out by the induced electric and magnetic fields due to polarisation. 


7.5.4 Causality and the Kramers-Kronig Relation 
Throughout this section, we used the relationship between the polarisation p and ap- 
plied electric field E. In frequency space, this reads 

p(w) = a(w) E(w) (7.37) 


Relationships of this kind appear in many places in physics. The polarisability a(w) is 
an example of a response function. As their name suggests, such functions tell us how 
some object — in this case p — respond to a change in circumstance — in this case, the 
application of an electric field. 


= 200 = 


There is a general theory around the properties of response functions. The most 
important fact follows from causality. The basic idea is that if we start off with a 
vanishing electric field and turn it on only at some fixed time, tą, then the polarisation 
shouldn’t respond to this until after t,. This sounds obvious. But how is it encoded in 
the mathematics? 


The causality properties are somewhat hidden in (7.37) because we’re thinking of 
the electric field as oscillating at some fixed frequency, which implicitly means that it 
oscillates for all time. If we want to turn the electric field on and off in time then we 
need to think about superposing fields of lots of different frequencies. This, of course, 
is the essence of the Fourier transform. If we shake the electric field at lots of different 
frequencies, its time dependence is given by 


to dw ait 
E(t) = f on E(w) e 


OO 


where, if we want E(t) to be real, we should take E(—w) = E(w)*. Conversely, for a 
given time dependence of the electric field, the component at some frequency w is given 
by the inverse Fourier transform, 


E(w) = [- dt E(t) e™* 


Let’s now see what this tells us about the time dependence of the polarisation p. Using 
(7.37), we have 


a 2T 

+= q too mE 
-| Faw) f d mejet 

=o 20 oa 


= f dt &(t — t')E(t’) (7.38) 


where, in the final line, we’ve introduced the Fourier transform of the polarisability, 
T di ; 
vt) = — ae 7.39 
a= f alue (7.39) 
(Note that I’ve been marginally inconsistent in my notation here. I’ve added the tilde 


above & to stress that this is the Fourier transform of a(w) even though I didn’t do the 
same to p and E). 


5You can learn more about this in the Linear Response section of the lectures on Kinetic Theory. 
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Equation (7.38) relates the time dependence of p to the time dependence of the 
electric field E. It’s telling us that the effect isn’t immediate; the polarisation at time 
t depends on what the electric field was doing at all times t’. But now we can state the 
requirement of causality: the response function must obey 


att) =0:. for ¢< 0 


Using (7.39), we can translate this back into a statement mo 
about the response function in frequency space. When t < 
0, we can perform the integral over w by completing the 
contour in the upper-half plane as shown in the figure. Along Reo) 


the extra semi-circle, the exponent is —iwt — —oo for t < ‘ g 
0, ensuring that this part of the integral vanishes. By the 
residue theorem, the integral is just given by the sum of 
residues inside the contour. If we want a(t) = 0 for t < 0, we need there to be no poles. 
In other words, we learn that 


Figure 75: 


a(w) is analytic for Imw > 0 


In contrast, a(w) can have poles in the lower-half imaginary plane. For example, if you 
look at our expression for the polarisability in (7.29), you can see that there are two 
poles at w = —i7/2 + \/w? — 72/4. Both lie in the lower-half of the complex w plane. 


The fact that æa is analytic in the upper-half plane means that there is a relationship 
between its real and imaginary parts. This is called the Kramers-Kronig relation. Our 
task in this section is to derive it. We start by providing a few general mathematical 
statements about complex integrals. 


A Discontinuous Function 


First, consider a general function p(w). We’ll ask that p(w) is meromorphic, meaning 
that it is analytic apart from at isolated poles. But, for now, we won’t place any 
restrictions on the position of these poles. (We will shortly replace p(w) by a(w) which, 
as we’ve just seen, has no poles in the upper half plane). We can define a new function 
f(w) by the integral, 


iT w — w 


Hast f SAET (7.40) 


Here the integral is taken along the interval w’ € [a,b] of the real line. However, when 
w also lies in this interval, we have a problem because the integral diverges at w’ = w. 
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To avoid this, we can simply deform the contour of the integral into the complex plane, 
either running just above the singularity along w’ + ie or just below the singularity 
along w’ — ie. Alternatively (in fact, equivalently) we could just shift the position of 
the singularity to w + w +e. In both cases we just skim by the singularity and the 
integral is well defined. The only problem is that we get different answers depending 
on which way we do things. Indeed, the difference between the two answers is given by 
Cauchy’s residue theorem, 


[flu + te) — fw- io] = ple) (7.41) 


The difference between f(w+ie) and f(w—ie) means that the function f (w) is discontin- 
uous across the real axis for w € [a,b]. If p(w) is everywhere analytic, this discontinuity 
is a branch cut. 


We can also define the average of the two functions either side of the discontinuity. 
This is usually called the principal value, and is denoted by adding the symbol P before 
the integral, 


[flw + ie) + flw —te)] = =p | pe) du’ (7.42) 


iT W — w 


NI = 


We can get a better handle on the meaning of this principal part if we look at the real 
and imaginary pieces of the denominator in the integrand 1/[w’ — (w + ie)], 
1 Ww — w ie 


w—(wtie) (w -wte (w -we Geta 


The real and imaginary parts of this function are shown in the figures. 


We can isolate the real part by taking the sum of f(w + ie) and f(w — ie) in (7.42). 
It can be thought of as a suitably cut-off version of 1/(w’ — w). It’s as if we have 
deleted an small segment of this function lying symmetrically about divergent point w 
and replaced it with a smooth function going through zero. This is the usual definition 
of the principal part of an integral. 


Similarly, the imaginary part can be thought of as a regularised delta-function. As 
€ — 0, it tends towards a delta function, as expected from (7.41). 
Kramers-Kronig 


Let’s now apply this discussion to our polarisability response function a(w). We’ll be 
interested in the integral 


wER (7.44) 
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Figure 76: The real part of the function Figure 77: The imaginary part of the 
plotted with w’ = 1 and e = 0.5. function plotted with w’ = 1 and e = 0.5 


where the contour C skims just above the real axis, before closing at infinity in the 
upper-half plane. We’ll need to make one additional assumption: that a(w) falls off 
faster than 1/|w] at infinity. If this holds, the integral is the same as we considered in 
(7.40) with [a, b] — [—co, +00]. Indeed, in the language of the previous discussion, the 
integral is f(w — ie), with p = a. 


We apply the formulae (7.41) and (7.42). It gives 
1 +00 1 
et ere | dw’ aed — a(w) 


/ 
ie w! — w 


But we know the integral in (7.44) has to be zero since a(w) has no poles in the 
upper-half plane. This means that f(w — ie) = 0, or 


a(w) = ip D du’ ae) 


j / 
im T Ww — w 


The important part for us is that factor of “i” sitting in the denominator. Taking real 
and imaginary parts, we learn that 
tœ dus Im a(w! 
Rea(w) =P f e 
co T ww 
and 
t dus! Rea(w’) 
a T W Sw 


Ima(w) = -p | 


These are the Kramers-Kronig relations. They follow from causality alone and tell us 
that the imaginary part of the response function is determined in terms of the real 
part, and vice-versa. However, the relationship is not local in frequency space: you 
need to know Rea(w) for all frequencies in order to reconstruct Im a(w) for any single 
frequency. 
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7.6 Conductors Revisited 


Until now, we’ve only discussed electromagnetic waves propagating through insulators. 
(Or, dielectrics to give them their fancy name). What happens in conductors where 
electric charges are free to move? We met a cheap model of a conductor in Section 2.4, 
where we described them as objects which screen electric fields. Here we’ll do a slightly 
better job and understand how this happens dynamically. 


7.6.1 The Drude Model 


The Drude model is simple. Really simple. It describes the electrons moving in a 
conductor as billiard-balls, bouncing off things. The electrons have mass m, charge q 
and velocity v = t. We treat them classically using F = ma; the equation of motion is 


m— = qE — —v (7.45) 


The force is due to an applied electric field E, together with a linear friction term. This 
friction term captures the effect of electrons hitting things, whether the background 
lattice of fixed ions, impurities, or each other. (Really, these latter processes should 
be treated in the quantum theory but we’ll stick with a classical treatment here). The 
coefficient T is called the scattering time. It should be thought of as the average time 
that the electron travels before it bounces off something. For reference, in a good metal, 
T 107s. (Note that this friction term is the same as (7.27) that we wrote for the 
atomic polarisability, although the mechanisms behind it may be different in the two 
cases). 


We start by applying an electric field which is constant in space but oscillating in 
time 


E = E(w)e™ 


This can be thought of as applying an AC voltage to a conductor. We look for solutions 
of the form 


Plugging this into (7.45) gives 


(-iw $ z) v(w) = 2 E(w) 


The current density is J = nqv, where n is the density of charge carriers, so the solution 
tells us that 


J(w) = o(w)E(w) (7.46) 
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Figure 78: The real, dissipative part of | Figure 79: The imaginary, reactive part 
the conductivity of the conductivity 


This, of course, is Ohm’s law. The proportionality constant o(w) depends on the 
frequency and is given by 


Aga (7.47) 


It is usually referred to as the optical conductivity. In the limit of vanishing frequency, 
w = 0, it reduces to the DC conductivity, 


nq?r 


ODC = 


The DC conductivity is real and is inversely related to the resistivity p = 1/opc. In 
contrast, the optical conductivity is complex. Its real and imaginary parts are given by 


Opa Opc WT 
Reo(w) = IF gre and Im o(w) = ea) 
These are plotted for opc = 1 and 7 = 1: 


The conductivity is complex simply because we’re working in Fourier space. The real 
part tells us about the dissipation of energy in the system. The bump at low frequencies, 
w ~ 1/7, is referred to as the Drude peak. The imaginary part of the conductivity tells 
us about the response of the system. (To see how this is relevant note that, in the 
Fourier ansatz, the velocity is related to the position by v = r = —iwr). At very large 
frequencies, wT >> 1, the conductivity becomes almost purely imaginary, o(w) ~ i/wr. 
This should be thought of as the conductivity of a free particle; you’re shaking it so fast 
that it turns around and goes the other way before it’s had the chance to hit something. 


Although we derived our result (7.47) using a simple, Newtonian model of free elec- 
trons, the expression for the conductivity itself is surprisingly robust. In fact, it survives 
just about every subsequent revolution in physics; the development of quantum me- 
chanics and Fermi surfaces, the presence of lattices and Bloch waves, even interactions 
between electrons in a framework known as Landau’s Fermi liquid model. In all of 


— 206 — 


these, the optical conductivity (7.47) remains the correct answer®. (This is true, at 
least, at low frequencies, At very high frequencies other effects can come in and change 
the story). 


7.6.2 Electromagnetic Waves in Conductors 


Let’s now ask our favourite question: how do electromagnetic waves move through a 
material? The macroscopic Maxwell equations (7.14) that we wrote before assumed 
that there were no free charges or currents around. Now we’re in a conductor, we need 
to include the charge density and current terms on the right-hand side: 


D 
V:-D=p and VxH=3+2 
V-B=0 and VxE=- (7.48) 


It’s important to remember that here p refers only to the free charge. (We called it 
Pfree in Section 7.1). We can still have bound charge in conductors, trapped around the 
ions of the lattice, but this has already been absorbed in the definition of D which is 
given by 


D = e(w)E 
Similarly, the current J is due only to the free charge. 


We now apply a spatially varying, oscillating electromagnetic field, using the familiar 
ansatz, 


E(x, t) = E(w)e**) and B(x,t) = B(w)” (7.49) 


At this point, we need to do something that isn’t obviously allowed: we will continue 
to use Ohm’s law (7.46), even in the presence of a varying electric field, so that 


J(x, t) = o(w)E(w)eé t= (7.50) 


This looks dubious; we derived Ohm’s law by assuming that the electric field was the 
same everywhere in space. Why do we now get to use it when the electric field varies? 
For this to be valid, we need to assume that over the time scales 7, relevant in the 


6 As an extreme example, the conductivity of the horizon of certain black holes can be computed 
in general relativity. Even here, the result at low frequency is given by the simple Drude formula 
(7.47)! Details can be found in Gary Horowitz, Jorge Santos and David Tong, “Optical Conductivity 
with Holographic Lattices, arXiv:1204.0519. 
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derivation of Ohm’s law, the electric field is more or less constant. This will be true 
if the wavelength of the electric field, A = 27/|k] is greater than the distance travelled 
by the electrons between collisions. This distance, known as the mean free path, is 
given by l = (v)r, where v is the average speed. In most metals, | ~ 1077 m. (This is 
around 1000 lattice spacings; to understand how it can be so large requires a quantum 
treatment of the electrons). This means that we should be able to trust (7.50) for 
wavelengths \ > l ~ 1077 m, which is roughly around the visible spectrum. 


The continuity equation V - J + dp/dt = 0 tells us that if the current oscillates, then 
the charge density must as well. In Fourier space, the continuity equation becomes 


k- J F 
= — = ae) ke Rig) (7.51) 
w w 
We can now plug these ansatze into the Maxwell equations (7.48). We also need 
B = uH where, as previously, we’ll take u to be independent of frequency. We have 


V-D=p > i (dw) +i 


V-B=0 > k-BiWw)=0 


) k- E(w) =0 (7.52) 


As before, these tell us that the electric and magnetic fields are transverse to the 
direction of propagation. Although, as we mentioned previously, there is a caveat to 
this statement: if we can find a frequency for which e(w)+io(w)/w = 0 then longitudinal 
waves are allowed for the electric field. We will discuss this possibility in Section 7.6.3. 
For now focus on the transverse fields k-E = k - B = 0. 


The other two equations are 


VxH=J+ > => ikx B(w) = —ipw (e) + meh) E(w) 
VxE=-@ => kx E(w) =wBiu) 


The end result is that the equations governing waves in a conductor take exactly the 
same form as those derived in (7.30) governing waves in an insulator. The only differ- 


ence is that we have to make the substitution 
olw 
ew) — fw) = e(w) + a 


This means that we can happily import our results from Section 7.5. In particular, the 
dispersion relation is given by 


k- k = uef (w)w? (7.53) 
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Let’s now see how this extra term affects the physics, assuming that the optical con- 
ductivity takes the Drude form 

_ DC 

= l1—iwrT 


o(w) 


Low Frequencies 


At frequencies that are low compared to the scattering time, wr < 1, we have o(w) © 
cpc. This means that the real and imaginary parts of e° are 


ef — ett ett we +4 (e + Te) (7.54) 
w 


For sufficiently small w, we always have e$f >> e$f. This is the regime that we called res- 
onant absorption in Section 7.5. The physics here is the same; no waves can propagate 
through the conductor; all are absorbed by the mobile electrons. 


In this regime, the effective dielectric constant is totally dominated by the contribu- 
tion from the conductivity and is almost pure imaginary: e° ~ igpg/w. The dispersion 
relation (7.53) then tells us that the wavenumber is 


, - WO , 
k = kı + ik2 = Vipwopc = ae (1+%) 


So kı = kə. This means that, for a wave travelling in the z-direction, so k = kz, the 
electric field takes the form 


Ezi) = E(w)e~*/* oe) 


1 2 
ô — a 
ky V pwone 


The distance 6 is called the skin depth. It is the distance that electromagnetic waves 


where 


will penetrate into a conductor. Note that as w — 0, the waves get further and further 
into the conductor. 


The fact that kı = kp also tells us, through (7.34), that the electric and magnetic 
fields oscillate 7/4 out of phase. (The phase difference is given by tang = k2/k1). 
Finally, the magnitudes of the ratio of the electric and magnetic field amplitudes are 


IBw)| k [pone 
Elw] ow wW 


As w — 0, we see that more and more of the energy lies in the magnetic, rather than 
electric, field. 


given by 
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High Frequencies 
Let’s now look at what happens for high frequencies. By this, we mean both wr > 1, 


so that a(w) ~ iopc/wT and w > wy so that e(w) ~ co. Now the effective permittivity 
is more or less real, 


w2 


2 
eff AJ DC S Wp 
e (w) = €o — ga Téo ( — =) (7.55) 


where we are are using the notation of the plasma frequency w? = nq? /meo that we 
introduced in (7.35). What happens next depends on the sign of e°: 


e w > wp: At these high frequencies, e° > 0 and k is real. This is the regime of 
transparent propagation. We see that, at suitably high frequencies, conductors 
become transparent. The dispersion relation is w? = w? + c?k?. 


e w < wp: This regime only exists if wp > wo, 1/7. (This is usually the case). Now 
ef < 0 so k is purely imaginary. This is the regime of total reflection; no wave 
can propagate inside the conductor. 


We see that the plasma frequency w, sets the lower-limit for when waves can propagate 
through a conductor. For most metals, wp 1 ~ 107!°s with a corresponding wavelength 
of \, ~ 3 x 107’? m. This lies firmly in the ultraviolet, meaning that visible light is 
reflected. This is why most metals are shiny. (Note, however, that this is smaller than 
the wavelength that we needed to really trust (7.50); you would have to work harder 
to get a more robust derivation of this effect). 


There’s a cute application of this effect. In the upper atmosphere of the Earth, 
many atoms are ionised and the gas acts like a plasma with wp ~ 27 x 9 MHz. Only 
electromagnetic waves above this frequency can make it through. This includes FM 
radio waves. But, in contrast, AM radio waves are below this frequency and bounce 
back to Earth. This is why you can hear AM radio far away. And why aliens can’t. 
7.6.3 Plasma Oscillations 


We noted in (7.52) that there’s a get out clause in the requirement that the electric 
field is transverse to the propagating wave. The Maxwell equation reads 


V-D=p. = i(t) Bee) =0 


Which means that we can have k- E Æ 0 as long as f (w) = e(w) + io(w)/w = 0. 
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We could try to satisfy this requirement at low frequencies where the effective per- 
mittivity is given by (7.54). Since we typically have €; >> €z in this regime, this is 
approximately 


ei 


Koj 
w) & ey +i— 
Which can only vanish if we take the frequency to be purely imaginary, 


Ww = —t— 
E1 


This is easy to interpret. Plugging it into the ansatz (7.49), we have 
E(x, t) = E(w) e™®* e7700t/s 


which is telling us that if you try to put such a low frequency longitudinal field in a 
conductor then it will decay in time ~ €,;/opc. This is not the solution we’re looking 
for. 


More interesting is what happens at high frequencies, w >> 1/7, wo, where the effective 
permittivity is given by (7.55). It vanishes at w = wy: 


eF (wp) & 0 


Now we can have a new, propagating solution in which B = 0, while E is parallel to k. 
This is a longitudinal wave. It is given by 


E(x,t) = E(@, ea" = 8 


By the relation (7.51), we see that for these longitudinal waves the charge density is 
also oscillating, 


p(x, t) = k- E(w,)e*x-¥e) 
These are called plasma oscillations. 


Note that, while the frequency of oscillation is always wp, the wavenumber k can 
be anything. This slightly strange state of affairs is changed if you take into account 
thermal motion of the electrons. This results in an electron pressure which acts as a 
restoring force on the plasma, inducing a non-trivial dispersion relation. When quan- 
tised, the resulting particles are called plasmons. 


7.6.4 Dispersion Relations in Quantum Mechanics 


So far we’ve derived a number of dispersion relations for various wave excitations. In 
all cases, these become particle excitations when we include quantum mechanics. 
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The paradigmatic example is the way light waves are comprised of photons. These 
are massless particles with energy E and momentum p given by 


E=hw and p=hk (7.56) 


With this dictionary, the wave dispersion relation becomes the familiar energy-momentum 
relation for massless particles that we met in our special relativity course, 


w=ke > E=pc 


The relationships (7.56) continue to hold when we quantise any other dispersion re- 
lation. However, one of the main lessons of this section is that both the wavevector 
and frequency can be complex. These too have interpretations after we quantise. A 
complex k means that the wave dies away quickly, typically after some boundary. In 
the quantum world, this just means that the particle excitations are confined close to 
the boundary. Meanwhile, an imaginary w means that the wave dies down over time. 
In the quantum world, the imaginary part of w has the interpretation as the lifetime 
of the particle. 


7.7 Charge Screening 


Take a system in which charges are free to move around. To be specific, we’ll talk 
about a metal but everything we say could apply to any plasma. Then take another 
charge and place it at a fixed location in the middle of the system. This could be, for 
example, an impurity in the metal. What happens? 


The mobile charges will be either attracted or repelled by the impurity. If the impu- 
rity has positive charge, the mobile, negatively charged electrons will want to cluster 
around it. The charge of these electrons acts to cancel out the charge of the impurity 
so that, viewed from afar, the region around the impurity will appear to have greatly 
reduced charge. There is a similar story if the charge of the impurity is negative; now 
the electrons are repelled, exposing the lattice of positively charged ions that lies un- 
derneath. Once again, the total charge of a region around the impurity will be greatly 
reduced. This is the phenomenon of charge screening. 


Our goal here is to understand more quantitatively how this happens and, in par- 
ticular, how the effective charge of the impurity changes as we move away from it. As 
we'll see, ultimately quantum effects will result in some rather surprising behaviour. 
I should mention that, unlike other parts of these notes, this section will need results 
from both quantum mechanics and statistical mechanics. 
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7.7.1 Classical Screening: The Debye-Htickel model 


We’ll start by looking at a simple classical model for charge screening which will give 
us some intuition for what’s going on. Our metal consists of a mobile gas of electrons, 
each of charge q. These are described by a charge density p(r). In the absence of any 
impurity, we would have p(r) = po, some constant. 


The entire metal is neutral. The charges of the mobile electrons are cancelled by 
the charges of the ions that they leave behind, fixed in position in the crystal lattice. 
Instead of trying to model this lattice with any accuracy, we’ll simply pretend that it 
has a uniform, constant charge density —po, ensuring that the total system is neutral. 
This very simple toy model sometimes goes by the toy name of jellium. 


Now we introduce the impurity by placing a fixed charge Q at the origin. We want 
to know how the electron density p(r) responds. The presence of the impurity sets up 
an electric field, with the electrostatic potential @(r) fixed by Gauss’ law 


V? =- (Q5%(t) — po + ple) (7.57) 


Here the —po term is due to the uniform background charge, while p(r) is due to the 
electron density. It should be clear that this equation alone is not enough to solve for 
both p(r) and (r). To make progress, we need to understand more about the forces 
governing the charge distribution p(r). This sounds like it might be a difficult problem. 
However, rather than approach it as a problem in classical mechanics, we do something 


clever: we import some tools from statistical mechanics’. 


We place our system at temperature T. The charge density p(r) will be proportional 
to the probability of finding a charge q at position r. If we assume that there are no 
correlations between the electrons, this is just given by the Bolzmann distribution. The 
potential energy needed to put a charge q at position r is simply q@(r) so we have 


p(r) = po edo) /keT (7.58) 


where the normalisation po is fixed by assuming that far from the impurity ¢(r) > 0 
and the system settles down to its original state. 


See the lecture notes on Statistical Physics. The Debye-Hiickel model was described in Section 2.6 
of these notes. 
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The result (7.58) is a very simple solution to what looks like a complicated problem. 
Of course, in part this is the beauty of statistical mechanics. But there is also an 
important approximation that has gone into this result: we assume that a given electron 
feels the average potential produced by all the others. We neglect any fluctuations 
around this average. This is an example of the mean field approximation, sometimes 
called the Hartree approximation. (We used the same kind of trick in the Statistical 
Physics notes when we first introduced the Ising model). 


For suitably large temperatures, we can expand the Boltzmann distribution and write 


ne) (1-2...) 


Substituting this into Gauss’ law (7.57) then gives 


(v: E x oe) = - 25%) 


(7.59) 


We’ve written this in terms of the number density no of electrons instead of the charge 
density pọ = qno. The solution to this equation is 
Q e7"/à D 
o(r) = or arena (7.60) 
This equation clearly shows the screening phenomenon that we’re interested in. At 
short distances r < Xp, the electric field due to the impurity doesn’t look very much 
different from the familiar Coulomb field. But at larger distances r >> Ap, the screening 
changes the potential dramatically and it now dies off exponentially quickly rather than 
as a power-law. Note that the electrons become less efficient at screening the impurity 
as the temperature increases. In contrast, if we take this result at face value, it looks 
as if they can screen the impurity arbitrarily well at low temperatures. But, of course, 
the classical description of electrons is not valid at low temperatures. Instead we need 
to turn to quantum mechanics. 


7.7.2 The Dielectric Function 


Before we look at quantum versions of screening, it’s useful to first introduce some 
new terminology. Let’s again consider introducing an impurity into the system, this 
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time with some fixed charge distribution p™*(r), where “ext” stands for “external”. We 
know that, taken on its own, this will induce a background electric field with potential 


ext 
Vv? ext oF 
$ z 
But we also know that the presence of the impurity will affect the charge distribution 
of the mobile electrons. We’ll call p'™4(r) = p(r) — po the “induced charge”. We know 
that the actual electric field will be given by the sum of p™ and p™4, 


Vo= -E (™ (e) + p(n) 


This set-up is very similar to our discussion in Section 7.1 when we first introduced 
the idea of polarisation P and the electric displacement D. In that case, we were 
interested in insulators and the polarisation described the response of bound charge to 
an applied electric field. Now we’re discussing conductors and the polarisation should 
be thought of as the response of the mobile electrons to an external electric field. In 
other words, V-P = —pi™’. (Compare this to (7.5) for an insulator). Meanwhile, the 
electric displacement D is the electric field that you apply to the material, as opposed 
to E which is the actual electric field inside the material. In the present context, that 
means 


E=-Vọ and D=-eaVe™ 


When we first introduced E and D, we defined the relationship between them to be 
simply D = «E, where e is the permittivity. Later, in Section 7.5, we realised that e€ 
could depend on the frequency of the applied electric field. Now we’re interested in 
static situations, so there’s no frequency, but the electric fields vary in space. Therefore 
we shouldn’t be surprised to learn that € now depends on the wavelength, or wavevector, 
of the electric fields. 


It’s worth explaining a little more how this arises. The first thing we could try is 
to relate E(r) and D(r). The problem is that this relationship is not local in space. 
An applied electric field D(r) will move charges far away which, in turn, will affect the 
electric field E(r) far away. This means that, in real space, the relationship between D 
and E takes the form, 


D(r) = fer elr — r’)E(r’) (7.61) 


The quantity e(r —r’) is known as the dielectric response function. It depends only on 
the difference r — r’ because the underlying system is translationally invariant. This 
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relationship looks somewhat simpler if we Fourier transform and work in momentum 
space. We write 


dèk 
(27)? 
and similar expressions for other quantities. (Note that we’re using the notation in 


which the function and its Fourier transform are distinguished only by their argument). 
Taking the Fourier transform of both sides of (7.61), we have 


eikr D(k) 


D= fare De) e Do- f 


D(k) = fer e Dir) = pear far ee e(r — r’) e" Er’) 


But this final expression is just the product of two Fourier transforms. This tells us 
that we have the promised expression 


The quantity e(k) is called the dielectric function. The constant permittivity that we 
first met in Section 7.1 is simply given by e(k — 0). 


In what follows, we’ll work with the potentials ọ and charge densities p, rather than 
D and E. The dielectric function is then defined as 


o™ (k) = e(k)o(k) (7.62) 
We write ¢ = ®t + ¢4, where 
2 pind _ ps 2 pind/]-\ _ p™ (k) 
—V*e a => ko CS ae 


Rearranging (7.62) then gives us an expression for the dielectric function in terms of 
the induced charge p'™4 and the total electrostatic potential œ. 
1 o™t(k) 


e(k) =1— ak oil) (7.63) 


This will turn out to be the most useful form in what follows. 


Debye-Hiickel Revisited 


So far, we’ve just given a bunch of definitions. They’ll be useful moving forward, 
but first let’s see how we can recover the results of the Debye-Hiickel model using 
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this machinery. We know from (7.58) how the induced charge p'™¢ is related to the 
electrostatic potential, 


: r 
(r) = Po (e7990)/kaT = 1) y aian +... (7.64) 


To leading order, we then also get a linear relationship between the Fourier components, 


p(k) ~ -7 6(k) 


kgT 
Substituting this into (7.63) gives us an expression for the dielectric function, 
kp 
e(k) =1+ 7) (7.65) 


where k? = qpo/eokpT = 1/4, with Ap the Debye screening length that we introduced 
n (7.59). 


Let’s now see the physics that’s encoded in the dielectric function. Suppose that we 
place a point charge at the origin. We have 


Q 


Amer 


Q 


g"(k) = a 


g(r) = 


Then, using the form of the dielectric function (7.65), the resulting electrostatic poten- 
tial @ is given by 
o™(k) Q 


PU) = Te) eq(R? + BB) 


We need to do the inverse Fourier transform of ¢(k) to find g(r). Let’s see how to do 
it; we have 


= atk ekr Q k? eikr cos 0 
g(r) = / Ca (k) = On), nae la sino f dk Pie? 


where, in the second equality, we’ve chosen to work in spherical polar coordinates in 
which the k, axis is aligned with r, so that k -r = krcos@. We do the integrals over 
the two angular variables, to get 


dk k? : 2 sin kr 
0 ek. kr 
Q f. k sin kr 
Sn o k 

(277)€0r Jæ a k2 + k? 


Q i. dk kett" 
= Re a 
Qregr Zœ 2ri k? + k? 


= 
| 
3 
Ale 
fan) 
© 
; 
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We compute this last integral by closing the contour in the upper-half plane with 
k — +ioo, picking up the pole at k = +ikp. This gives our final answer for the 
electrostatic potential, 


Q e7t/AD 
plr) = -irer 

Tor 
That’s quite nice: we see that the dielectric function (7.65) contains the same physics 
(7.60) that we saw earlier in the direct computation of classical electrostatic screening. 
We could also compute the induced charge density to find 


Qe—7/” 


ind — 
a An d*,r 


which agrees with (7.64). 


But the dielectric function e(k) contains more information: it tells us how the system 
responds to each Fourier mode of an externally placed charge density. This means that 
we can use it to compute the response to any shape p**(r). 


Here, for example, is one very simple bit of physics contained in e(k). In the limit 
k — 0, we have e(k) + oo. This means that, in the presence of any constant, applied 
electric field D, the electric field inside the material will be E = D/e = 0. But you 
knew this already: it’s the statement that you can’t have electric fields inside conductors 
because the charges will always move to cancel it. More generally, classical conductors 
will effectively screen any applied electric field which doesn’t vary much on distances 
smaller than Ap. 


7.7.3 Thomas-Fermi Theory 


The Debye-Hiickel result describes screening by classical particles. But, as we lower the 
temperature, we know that quantum effects become important. Our first pass at this 
is called the Thomas-Fermi approximation. It’s basically the same idea that we used in 
the Debye-Htickel approach, but with the probability determined by the Fermi-Dirac 
distribution rather than the classical Boltzmann distribution. 


We work in the grand canonical ensemble, with temperature T and chemical potential 
u. Recall that the probability of finding a fermion in a state |k} with energy Ep is given 
by the Fermi-Dirac distribution 


1 
f(k) = el(Er- p) /kBT 4 | 


(7.66) 


= 218 = 


The chemical potential u is determined by the requirement that the equilibrium charge 
density is p(u) = po, where 


dk q 
pl) = gs f Gays eE Tg] (7.67) 


Here g, is the spin degeneracy factor which we usually take to be g, = 2. 


Let’s now place the external charge density p***(r) in the system. The story is the 
same as we saw before: the mobile charges move, resulting in an induced charge density 
p™{(r), and a total electrostatic potential ¢(r). The Thomas-Fermi approximation 
involves working with the new probability distribution 


1 
Mkr) = e 4] 


(7.68) 


This can be thought of as either changing the energy to E = E,+q¢@(r) or, alternatively, 
allowing for a spatially varying chemical potential u > u — qọ(r). 


The first thing to say about the probability distribution (7.68) is that it doesn’t make 
any sense! It claims to be the probability for a state with momentum k and position 
r, yet states in quantum mechanics are, famously, not labelled by both momentum 
and position at the same time! So what’s going on? We should think of (7.68) as 
an approximation that is valid when ¢(r) is very slowly varying compared to any 
microscopic length scales. Then we can look in a patch of space where ¢(r) is roughly 
constant and apply (7.68). In a neighbouring patch of space we again apply (7.68), 
now with a slightly different value of ¢(r). This idea of local equilibrium underlies the 
Thomas-Fermi (and, indeed, the Debye-Hiickel) approximations. 


Let’s see how this works in practice. The spatially dependent charge density is now 
given by 


oe ak q 7.69 
p(T; LL) = Qs (27)? e(Ek+aġ(r)-u)/kBT +1 l l ) 


We’re interested in computing the induced charge density p'™4(r) = p(r) — po. Combin- 
ing (7.69) and (7.67), we have 


ind (Y) a Ëk q 7 q 
d a (27)3 |eErtaglr)-u)/keT 41 e(ÆEr-u)/kBT 4] 
But we can rewrite this using the notation of (7.67) simply as 


p(n) 
om 


P(r) = p (u — 99(r)) — plu) ~ —ad(r) 
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where, in the last step, we have Taylor expanded the function which is valid under 
the assumption that qé(r) < u. But this immediately gives us an expression for the 
dielectric function using (7.63), 


Op q 
k) =1+ ——-; 
el ) T ðu Eok? 
We’re almost there. We still need to figure out what Op/Op is. This is particularly easy 
if we work at T = 0, where we can identify the chemical potential u with the Fermi 
energy: u = Ep. In this case, the Fermi-Dirac distribution is a step function and the 
total charge density is simply given by 


p(Er) = af dE g(E) 


where g(F) is the density of states (we’ll remind ourselves what form the density of 
states takes below). We learn that 0p/OEpr = g(Ep) and the dielectric function is given 
by 
2 
E 
e(k)=1+ goer) (7.70) 
cok 
Note that the functional form of e(k) is exactly the same as we saw in the classical case 
(7.65). The only thing that’s changed is the coefficient of the 1/k? term which, as we 
saw before, determines the screening length. Let’s look at a simple example. 


A Simple Example 


For non-relativistic particles, the energy is given by E = h?k?/2m. In three spatial 
dimensions, the density of states is given by® 


This is kind of a mess, but there’s a neater way to write g(r). (This neater way 
will also allow for a simple comparison to the Debye screening length as well). At zero 
temperature, the total charge density is 


Ep 
po = af dE g(E) 
0 


8See the lecture notes on Statistical Physics for details on how to compute the density of states. 
The g(£) we use here differs slightly from that presented in the Statistical Physics lectures because it 
does not include an overall volume factor. This is because we want to compute the number density of 
particles rather than the total number of particles. 
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Using this, we have 


and we can write the dielectric function as 


ktr 
where kŻp = 3qpo/2¢9E pr. This is our expression for the Thomas-Fermi screening length 
ATR = L/krp. 


It’s instructive to compare this screening length with the classical Debye length Xp. 
We have 


where Tp = kgEr is the Fermi temperature. The classical analysis can only be trusted 
at temperature T >> Tr where Ap > Arr. But, for metals, the Fermi temperature 
is hot; something like 10*,K. This means that, at room temperature, T < Tp and 
our quantum result above (which, strictly speaking, was only valid at T = 0) is a 
good approximation. Here Ap < Arr. The upshot is that quantum mechanics acts to 
increase the screening length beyond that suggested by classical physics. 


7.7.4 Lindhard Theory 


The Thomas-Fermi approximation is straightforward, but it relies crucially on the po- 
tential ¢(r) varying only over large scales. However, as we will now see, the most 
interesting physics arises due to variations of ¢(r) over small scales (or, equivalently, 
large k). For this we need to work harder. 


The key idea is to go back to basics where, here, basics means quantum mechanics. 
Before we add the impurity, the energy eigenstates are plane waves |k) with energy 
E(k) = h?k?/2m. To determine the dielectric function (7.63), we only need to know 
how the mobile charge density p(r) changes in the presence of a potential (r). We can 
do this by considering a small perturbation to the Hamiltonian of the form 


AH = q¢(r) 


The energy eigenstate that is labelled by k now shifts. We call the new state |v(k)). 
Ultimately, our goal is to compute the induced charge density. For an electron in state 
|w(k)), the probabilty of finding it at position r is simply |(r|w(k))|?.. Which means 
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that, for this state, the change in the density is |(r|q(k))|? — |(r|k)|?.. The induced 
charge density p™4(r) is obtained by summing over all such states, weighted with the 
Fermi-Dirac distribution function. We have 

3 


p(n) = 49g an f(k) [[Orleb(k)) 2 = als) 


where f(k) is the Fermi-Dirac distribution (7.66) and we’ve remembered to include the 
spin degeneracy factor g, = 2. To make progress, we need to get to work computing 
the overlap of states. 


To first order in perturbation theory, the new energy eigenstate is given by 


podlo) = Ik) + f SST Ik 


Keeping only terms linear in AH, we can expand this out to read 


BK le (AHIR) 
(27)? E(k) — E(k’) 


But we have expressions for each of these matrix elements. Of course, the plane waves 


(k’|A Hk) 


fi) na) 


(r|k’) 


(rl)? — [delle [? = f 


take the form (r|k) = e™", while the matrix elements of the perturbed Hamiltonian 
are 


(k'|q@(r)|k) = J PraPr! Mt) (r go(r)|r) = gb(k — k’) 


In other words, it gives the Fourier transform of the electrostatic potential. Putting 
this together, we arrive at an integral expression for the induced charge, 


p(n) = tan f CRE pay [ee oe 
(Qa)? (27)? E(k) — E(k’) E(k) — E(k’) 
Of course, what we really want for the dielectric function (7.63) is the Fourier transform 
of the induced charge, 


p(k = per er) 


Thankfully, doing the f d?r integral gives rise to a delta-function which simplifies our 
life rather than complicating it. Performing some relabelling of dummy integration 
variables, we have 


p(k) 5 f BR, 1 | 1 
my 8 | apt COREN "apj akrep) VM 
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These two terms are more similar than they look. If we change the dummy integration 
variable in the first term to k’ — k’ + k then we can write 
p(k) _ > f ak f(k +k'|)- f(k’) 
olk) J (27)? E(k +kK])- E(k’) 
The left-hand side is exactly what we want. The right-hand side is an integral. It’s 
not too hard to do this integral, but let’s first check that this result gives something 


(7.72) 


sensible. 


Thomas-Fermi Revisited 


Let’s first see how we can recover the Thomas-Fermi result for the dielectric function. 
Recall that the Thomas-Fermi approximation was only valid when the potential ¢(r), 
and hence the induced charge p™¢(r), vary slowly over large distances. In the present 
context, this means it is valid at small k. But here we can simply Taylor expand the 
numerator and denominator of (7.72). 


E(|k+k’|) — E(k’) = ak k 
Of OE 
and f(lk +k!) so) m SES 
So we have 
gk) =, 2 dk! Of 7, Of 
wy 8 | Gap an © | A M2 5p 


where the last step is essentially the definition of the density of states g( E&E). But at T = 
0, the Fermi-Dirac distribution f(E) is just a step function, and Of /OE = —ô(E — Erp). 
So at T = 0, we get 
p™ (k) ¢°9(Er) 
o(k) Eok? 


which we recognise as the Thomas-Fermi result (7.70) that we derived previously. 


= q’°g(Er) = e(k) = + 


The Lindhard Function 


While the Thomas-Fermi approximation suffices for variations over large scales and 
small k, our real interest here is in what happens at large k. As we will now show, 
quantum mechanics gives rise to some interesting features in the screening when impu- 
rities have structure on scales of order ~ 1/kp where kp is the Fermi-wavevector. For 
this, we need to go back to the Lindhard result 
p(k) _ > f ak f(k +k']) — f(k’) 

p(k) J (27)? E(|k + k’|) — E(k’) 
Our task is to do this integral properly. 
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a) k<2kp b) k=2kp c) k>2kp 


Figure 80: The two Fermi surfaces in momentum space. The integration region © is shown 
shaded in red for a) k < 2kp, b) k = 2kp and c) k > 2Qkp. 


Let’s firstly get a sense for what the integrand looks like. We’ll work at T = 0, so 
the Fermi-Dirac distribution function f(k) is a step function with 


1 k < kf 
k) = 


This makes the integral much easier. All the subtleties now come from figuring out 
which region in momentum space gives a non-vanishing contribution. The filled states 
associated to f(k’) form a ball in momentum space of radius kp, centered at the origin. 
Meanwhile, the filled states associated to f(|k’ + k|) form a ball in momentum space 
of radius kp centered at k’ = —k. These are shown in a number of cases in Figure 80. 
Because the integral comes with a factor of f(|k + k’|) — f(k’), it gets contributions 
only from states that are empty in one ball but filled in the other. We call this region 
X; it is the shaded red region shown in the figures. There is a also a mirror region 
in the other ball that also contributes to the integral, but this simply gives an overall 
factor of 2. So we have 


p(k) E dk’ 1 
de) n s Qn) E(k+k) E) 


The important physics lies in the fact that the nature of X changes as we vary k. For 
k < 2kp, 4 is a crescent-shaped region as shown in Figure 80a. But for k > 2kp, X is 
the whole Fermi ball as shown in Figures 80b and 80c. 


We’ll work with non-relativistic fermions with E = h?k?/2m. While the graphical 
picture above will be useful to get intuition for the physics, to do the integral it’s 
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actually simpler to return to the form (7.71). At zero temperature, we have 


p(k) J dK 1 | 1 
dk) TS Jeep, OTP | =k? 2k- Kk! | kh? — 2k! 
> 2 2m ark 2 
E qos h? k'<kp (27)? k2 = 2k! 2 k 


where the two terms double-up because rotational symmetry ensures that the physics 
is invariant under k + —k. Now the integration domain remains fixed as we vary 
k, with the graphical change of topology that we saw above buried in the integrand. 
For k < 2kp, the denominator in the integrand can vanish. This reflects the fact 
that transitions between an occupied and unoccupied state with the same energy are 
possible. It corresponds to the situation depicted in Figure 80a. But for k > 2kp, the 
denominator is always positive. This corresponds to the situation shown in Figure 80c. 


To proceed, we work in polar coordinates for k’ with the z-axis aligned with k. We 


have 
ind 2 T kp 12 
Aa F dð sind | ay —— 
0 0 k 


o(k) (2m 27 2 — 2kk' cos 0 
_ 2mq?gs 1 Be ces k? + 2kk' 
~ rh k / ON METZ 


But this is now an integral that we can do; the general form is 


ay +b by Lf ge P ay +b 
dy yl = eee | ss 
[ex vio (2+5) ani rd a —ay +b 
We then have 
ind (x 9 29, 1 k 1 2 
p(k) mag ka (1-5) tog 


Qkkp + k? 
olk) (20 2h2k —Qkkp + k? 


This gives our final expression, known as the Lindhard dielectric function, 


ae k 


where all the constants that we gathered along our journey sit in k?,, = q?g(Er)/€o = 


2 2 


4 


gs’ mk p/2n7h?e9. This is the Thomas-Fermi wave result that we saw previously, but 
now it is dressed by the function 


2 


1 1 
F(x) =-—-4 log 
2 x 


etl 
x—-1 


At small k we have F(x — 0) = 1 and we recover the Thomas-Fermi result. 
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For variations on very small scales, we’re interested in the large k regime where 
x — œ and F(x) > 1/32”. (You have to go to third order in the Taylor expansion of 
the log to see this!). This means that on small scales we have 


Akt. pki 
3k4 


e(k) 21+ 
However, the most interesting physics occurs at near k = 2kp. 


7.7.5 Friedel Oscillations 


We saw above that there’s a qualitative difference in the accessible states when k < 2kp 
and k > 2k. Our goal is to understand what this means for the physics. The dielectric 
function itself is nice and continuous at k = 2kp, with F(x = 1) = 1/2. However, it is 
not smooth: the derivative of the dielectric function suffers a logarithmic singularity, 


1 x—-1 
F’ 17 =] 
(z > )> 5 tos ( 7 ) 


This has an important consequence for the screening of a point charge. 


As we saw in Section 7.7.2, a point charge gives rise to the external potential 


Q 


Eok? 


g™(k) = 


and, after screening, the true potential is ¢(k) = $%*(k)/e(k). However, the Fourier 
transform back to real space is now somewhat complicated. It turns out that it’s easier 
to work directly with the induced charge density p™4(r). From the definition of the 
dielectric function (7.63), the induced charge density in the presence of a point charge 
p% (k) = Q/ek? is given by, 


where, for k ~ 2kp, we have 


e(k)-1 kbp k—2kp.  (k—2Qkp 
= ía — 
e(k) 8k ke N ere) 


Now we want to Fourier transform this back to real space. We repeat the steps that 
we took in Section 7.7.2 for the Debye-Hiickel model to get 


mosa f See (M) Bt fm (MM) at 
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At this stage, it’s useful if we integrate by parts twice. We have 


marc Q 1 fa 3 kek) HR. 5 
p ‘= a S, dk al eK) ) sin 


Of course, the Fourier integral requires us to know ¢(k) at all values of k, rather than 


just around k = 2k;. Suppose, however, that we’re interested in the behaviour a long 
way from the point charge. At large r, the sinkr factor oscillates very rapidly with 
k, ensuring that the induced charge at large distances is essentially vanishing. This 
was responsible for the exponential behaviour of the screening that we saw in both the 
Debye-Hiickel and Thomas-Fermi models. However, at k = 2k the other factor in the 
integrand diverges, 


d? (ke(k) —k 7 kapo 1 

dk? e(k) ~ 4k? k — Qkp 

This will now give rise to a long-range contribution. Therefore, if we only care about 
this long-distance behaviour, we need only integrate over some small interval J about 
k = 2kp, 


i Oral sin kr 
pir) a ore fa 


8n2k2, r3 k — 2kr 
_ Qkîr 1 J cos(2kpr) sin((q — 2kr)r) _ sin(2kpr)cos((k — 2kr)r) 
— 8rkèr3 Jy k — 2kp k — 2kp 


where we’ve used a little trigonometry. The second term above vanishes on parity 
grounds (contributions from either side of k = kp cancel). We can approximate the 
first term by extending the range of the integral to all k (because, as we’ve just argued, 
the main contribution comes from the interval J anyway). Using pe dx sin e/a =r, 
we get our final expression for the long-distance charge density induced by a point 
charge, 


inda Qkîr cos(2krr) 
PS 8rk2, r3 


We learn that the effect of the Fermi surface is to dramatically change the screening 


(7.74) 


of electric charge. Instead of the usual exponential screening, we instead find a power- 
law fall off, albeit weaker than the Coulomb force in vacuum (i.e. 1/r? instead of 1/r). 
Moreover, the sign of the induced charge oscillates. These are called Friedel oscillations. 
They provide a very visual way to see the edge of the Fermi surface. This figure shows 


9 


some Friedel oscillations on a two-dimensional surface”. You can make out a bright 


°The figure is taken from Direct Observation of Friedel Oscillations around Incorporated Sica 
Dopants in GaAs by Low-Temperature Scanning Tunneling Microscopy by M van der Wielen, A van 
Roij and H. van Kempen, Physical Review Letters 76, 7 (1996). 
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central region, surrounded by a black ring, surrounded by another white ring. This 


corresponds to a Fermi wavelength of around Àp ~ 1078 m. 


Heuristically, what’s going on here is that the wave- 
function of the electrons has a finite size.. At zero tem- 
perature, the states with lowest energy have wavelength 
\ = 1/kp. These modes enthusiastically cluster around 
the impurity, keen to reduce its charge but, unaware 
of their own cumbersome nature, end up overscreening. 
Other electrons have to then respond to undo the dam- 
age and the story is then repeated, over exuberance piled 
upon over exuberance. The end result is a highly inef- 
ficient screening mechanism and the wonderful rippling 
patterns of charge that are seen in scanning tunnelling 
microscopes. 
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Figure 81: Friedel oscilla- 
tions in GaAs doped with Sil- 
icon. 


